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I investigate statistical properties of one-dimensional fields in the universe such as the 
Lja forest (Lya absorptions in the quasar spectrum) and inverted line-of-sight densities. 
The Lya forest has opened a great window for studying the large-scale structure of the 
universe, because it can probe the cosmic density field over a wide range of redshift 
at relatively high resolution, which has not been easily accessible with other types of 
observations. 

The power spectrum completely characterizes Gaussian random fields. However, be- 
cause of gravitational clustering, the cosmic density field is already quite non-Gaussian on 
scales below 10 h~^Mpc at redshift z = 3. I analyze the covariance of the one-dimensional 
mass power spectrum, which involves a fourth-order statistic, the trispectrum. The co- 
variance indicates that Fourier modes in the cosmic density field are highly correlated and 
that the variance of the measured one-dimensional mass power spectrum is much higher 
than the expectation for Gaussian random fields. It is found that rare high-density struc- 
tures contribute significantly to the covariance. The window function due to the length 
of lines of sight introduces additional correlations between different Fourier modes. 

In practice, one observes quasar spectra instead of one-dimensional density fields. As 
such, fiux power spectrum has been the basis of many works. 1 show that the nonlinear 
transform between density and flux quenches the fluctuations so that the flux power 
spectrum is less sensitive to cosmological parameters than the one-dimensional mass 
power spectrum. The covariance of the flux power spectrum is nearly Gaussian, which 
suggests that higher-order statistics may be less effective for the flux. 

Finally, I provide a method for inverting Lya forests and obtaining line-of-sight den- 
sities, so that statistics can be measured from one-dimensional density fields directly. 
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CHAPTER 1 
INTRODUCTION 

The cosmological principle, i.e. the assumption of homogeneity and isotropy (Milne 1935), 
has been born out by decades of tests (for reviews, see Peebles 1980, 1993). As profound 
as it is, structures do rise from minute primordial fluctuations because of gravitational 
instability (e.g. Jeans 1928; Lemaitre 1934). On scales beyond galaxy clusters, the cosmic 
density field remains in the linear or quasi-linear regime, which means the formation and 
evolution of the large-scale structure still bears the initial imprint of cosmology. In other 
words, the large-scale structure is a sensitive tool for measuring fundamental properties 
of the universe. 

Galaxies and clusters of galaxies were the first objects used to study the large-scale 
structure of the universe (e.g. Hubble 1934; Shapley 1934). Recent surveys include the 
Center for Astrophysics Survey (Huchra et al. 1990), the Automatic Plate-measuring 
Machine Survey (Maddox et al. 1990), the Las Campanas Redshift Survey (Shectman 
et al. 1996), the Infrared Astronomical Satellite Point Source Catalog Redshift Survey 
(Saunders et al. 2000), the Two Degree Field (2dF) Galaxy Redshift Survey (Colless et 
al. 2001), and the Sloan Digital Sky Survey (SDSS, initial release, Stoughton et al. 2002). 
One useful statistic is the galaxy clustering power spectrum (PS, Peacock 1991; Baugh 
& Efstathiou 1993; Fisher et al. 1993; Lin et al. 1996; Percival et al. 2001; Dodelson 
et al. 2002). Because galaxies reside in high-density peaks of the cosmic density field, 
galaxy statistics may be (slightly) biased with respect to the true statistics of the field 
(Dekel & Silk 1986; Gaztanaga & Prieman 1994; Kauffmann, Nusser & Steinmetz 1997; 
Dekel & Lahav 1999; Benson et al. 2000; Peacock & Smith 2000; Somerville et al. 2001; 
Verde et al. 2002; Weinberg et al. 2004). Galaxy surveys are often limited by depth. For 
example, the mean redshift of the above-mentioned surveys is around 0.1 or less, which 
translates to a volume on the order of a thousandth of the Hubble volume. Such a small 
survey volume limits our ability to study the large-scale structure at high redshift and 
its evolution. 

Quasars, or quasi-stellar objects (QSOs), are far more luminous than galaxies, and 
they have been observed near the edge of the visible universe (Fan et al. 2001, 2003). The 
2dF QSO survey (Boyle et al. 2000) and the SDSS will provide large QSO samples for 
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statistics such as the QSO clustering correlation function and PS (Shaver 1984; Shanks 
et al. 1987; lovino & Shaver 1988; Mo & Fang 1993; Shanks & Boyle 1994; La Franca, 
Andreani & Cristiani 1998; Outram et al. 2003). Like galaxies, QSOs also live in high- 
density peaks of the universe, and their statistics may be biased as well. In fact, galaxies 
are often found to cluster around QSOs (Tyson 1986; Fisher et al. 1996; Pascarelle et 
al. 2001; Kauffmann & Haehnelt 2002). Because of large separations, QSOs statistics 
are more reliable for studying the clustering of matter above 10 h~^Mpc, where h is the 
Hubble constant in units of 100 km s~-^ Mpc~^. 

One often sees numerous Lya absorption lines in QSO spectra that are resulted from 
absorptions by the diffusely distributed and photoionizcd intcrgalactic medium (IGM). 
Absorption systems that have neutral-hydrogen column density 10-*^^ < A^hi < 10^^ cm~^ 
arc also called the Lya forest (for a review, sec Rauch 1998). It has been demonstrated 
by various hydrodynamical simulations that the IGM traces the density of the underlying 
mass field on large scales (Cen et al. 1994; Zhang, Anninos & Norman 1995; Hernquist et 
al. 1996; Miralda-Escude et al. 1996; Dave et al. 1997; Bryan et al. 1999). In other words, 
baryon densities in Lya absorption systems are roughly proportional to total matter 
densities for p/p < 10 (Bi & Davidsen 1997; Gnedin & Hui 1998; Zhang et al. 1998), 
where p is the density and p is the mean density. Therefore, the Lya forest offers a less 
biased probe of the large-scale structure of the universe. 

The Lya forest has been observed up to 2; ~ 6 (Becker et al. 2001), beyond which 
neutral hydrogen (Hi) completely absorbs the QSO flux at rest wavelength Aq = 1216 A 
due to the Gunn-Peterson effect (Gunn &; Peterson 1965). For each line of sight (LOS) to 
a QSO, one can sample the density field almost continuously in one dimension and obtain 
more information than the clustering of QSOs. With enough LOS's covering < z < 6, 
the Lya forest will enable us to establish a more complete picture of the universe and 
its evolution. In fact, statistics of the Lya forest have been applied to many aspects 
of large-scale structure studies such as recovering the initial linear mass PS (Croft et al. 
1998, 1999; Hui 1999; Feng & Fang 2000; McDonald et al. 2000; Gnedin & Hamilton 2002; 
Zaldarriaga, Scoccimarro &: Hui 2003), measuring the flux PS and bispectrum (Hui et al. 
2001; Mandclbaum et al. 2003; Kim et al. 2004; Viel et al. 2004), estimating cosmological 
parameters (McDonald & Miralda-Escude 1999; Zaldarriaga, Hui & Tegmark 2001; Croft 
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et al. 2002; Seljak, McDonald & Makarov 2003; McDonald 2003; Viel et al. 2003), inverting 
the Lya forest (Nusser & Haehnelt 1999; Pichon et al. 2001; Ztian 2003), finding the 
applicable range of the hierarchical clustering model (Feng, Pando & Fang 2001; Zhan, 
Jamkhedkar & Fang 2001), and estimating the velocity field (Zhan Sz Fang 2002). These 
studies show that the Lja forest has provided an important complement to studies based 
on galaxy and QSO samples. 

The unique nature of the Lya forest brings its own subtleties. First, the Lja for- 
est probes well into the nonlinear regime, e.g. from several h~^Mpc down to tens of 
h-^kpc. The non-Gaussianity on such small scales plays an important role in statistics, 
and cosmological hydrodynamical simulations are needed for the study. Second, the di- 
rect observable is the Lya flux rather than the density. Statistics of the density, which 
are more fundamental to cosmology, have to be recovered from flux statistics. Third, 
the observed Lya forest may be affected by many elements such as the continuum fitting 
(Hui et al. 2001), the UV ionization background (Ranch et al. 1997; Scott et al. 2000; 
McDonald &: Miralda-Escude 2001; Meiksin & White 2004), saturated Lya absorptions 
(Zhan 2003; Viel et al. 2004), metal-line contaminations, and redshift distortion. At 
low redshift, additional uncertainties arise from the thinning of the Lya forest (Riediger, 
Petitjean & Miicket 1998; Theuns, Leonard &; Efstathiou 1998) and the shock-heated 
warm-hot IGM (WHIM, Dave et al. 1999; Dave & Tripp 2001; Dave et al. 2001). 

This dissertation focuses on theoretical and numerical aspects of one-dimensional 
statistics of the large-scale structure. It is organized as follows. Chapter 2 discusses the 
relation between the one-dimensional PS and the three-dimensional PS with an emphasis 
on spatial average. The covariance of the one-dimensional mass PS is derived for Gaus- 
sian random fields (GRFs) and the cosmic density field in Chapter 3. One-dimensional 
mass PS's and their covariances are calculated in Chapter 4 for simulated density fields. 
Flux PS's and their covariances are measured from simulated as well as observed Lya 
forests in Chapter 5, and they are compared with corresponding mass PS's and covari- 
ances. Effects of the UV ionization background and the WHIM arc also discussed there. 
Chapter 6 introduces the mean density-width relation (Zhan 2003) and its application 
in extracting density information from saturated Lya absorptions. Chapter 7 concludes 
the dissertation. 



CHAPTER 2 
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ONE-DIMENSIONAL POWER SPECTRUM 

The one-dimensional mass PS and its relation to the three-dimensional mass PS have 
been frequently utilized to recover the linear mass PS from the Lyct forest (Croft et al. 
1998, 1999, 2002; Gnedin & Hamilton 2002). This opens a great window for studying the 
large-scale structure of the universe over a wide range of redshift. 

For an ensemble of isotropic fields, the one-dimensional mass PS PiY){k) is a simple 
integral of the three-dimensional mass PS P{k) (Lumsden, Heavens &; Peacock 1989), 

1 r°° 

Pm{k) = — P{k')k'dk', (2.1) 
27r Jk 

where k is the LOS wavenumber. However, there is only one observable universe. The 
ensemble average has to be replaced by a spatial average. For instance, one may sample 
multiple LOS densities from the three-dimensional cosmic density field and use the aver- 
age PS of the one-dimensional densities in place of the ensemble-average quantity. Such 
spatial average alters the relation between the one-dimensional mass PS and the three- 
dimensional mass PS. I discuss the modification to equation (2.1) and its significance in 
this chapter. Although I use the density field as an example, the results obtained here 
are generic. 

2.1 Three-Dimensional Power Spectrum 

It is beneficial to review the three-dimensional PS before we look at the one-dimensional 
PS. I assume the following convention of Fourier transforms for a cubic density field /9(x) 
in the volume V = B^: 

^ oo 

'^W = ^ E ^(n)e2-»-/^, (2.2) 

n= oo 

<5(n) = ! (5(x) dx, 
Jv 

where 5(x) = [/5(x) — p\/p is the overdensity, (5(n) is the Fourier transform of (5(x), the 
summation Yl<'^=-oo abbreviation for and the wavevector k equals 

27rn/iJ with integral ni, n2, and n^. I choose this form of Fourier transforms because real 
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surveys always have a finite volume and, after all, the visible universe is finite. Equation 
(2.2) will approach the limit of continuous Fourier transforms as the volume increases. 

With the understanding that Fourier modes exist only at discrete wavenumbers, 
i.e. ni, n2, and ns are integers, one may use k and n interchangeably for convenience. 
To be complete, the orthonormality relations are 

1 |^e-^-("-"')-/^dx = j5„„ (2.3) 

-. oo 

^ ^ g2.m.(x-x')/B ^ ^D(^_^/^^ (2.4) 

n=— oo 

where 6^^, is the three-dimensional Kronecker delta function, and d^{x. — x') the Dirac 
delta function. 

The three-dimensional mass PS of the universe is defined through 

(5(k)r(k')) = P(k)y<5^,„„ 

where (. . .) stands for an ensemble average. One may define an observed PS, 

P'(k) = mf/V, (2.6) 

so that -P(k) = (P'(k)). For an isotropic field, P(k) is a function of the length of k 
only, i.e. P(k) = P{k). A shot- noise term should be included if the PS is measured 
from discrete objects, and it is inversely proportional to the mean number density of the 
objects. I neglect the shot-noise in this dissertation. An alternative definition of P{k) 
utilizes the correlation function ^(r), e.g. 

^(r) = (<5(x + r)<5(x)), (2.7) 

P{k) = f ^{r)e-'^-''dr. (2.8) 
Jv 

The four-point function of 5(x) is 

(5(Xa)(5(xb)(^(Xc)(5(xd)) = ^ab^cd + Cac^bd + ^ad^bc + V, (2-9) 

where ^ab stands for C(|xa — Xb|), and rj is the reduced four-point correlation function 
that has six degrees of freedom arising from relative coordinates of the four points under 



(2.5) 
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the constraint of homogeneity (Peebles 1980). Prom equations (2.8) and (2.9), we have 
the four-point function of ^(k) 

(^(ka)5*(kb)(5(ke)^*(kd)) = ^'[P(A:a)P(A;c)5:^,,„,Cn, +^(A;a)P(A;b)<5:?,,„/:^„nJ 

+yr(ka,-kb,ke,-kd), (2.10) 

where 

r(ka, kb, kc, kd) = J r] exp[-ika • Xa - ikb • Xb - ike • Xc - ikd • Xd] dxa dxb dxc dxd 

is the trispectrum, and the wavevectors are restricted to be in the same hemisphere so 
that the term P{ka,)P{h))S^^ _n^S^^ _n^ does not appear. Since the reduced four-point 
correlation function r/ is a six- variable function, there is a redundancy in the variables 
of the trispectrum. It is evident from the four-point function that the covariance of the 
three-dimensional mass PS is (see also Meiksin & White 1999; Cooray & Hu 2001) 

a\ky) = ([P'(k) - P(k)][P'(k') - P(k')]) = (^(k)5*(k)^(k')r(k'))F-2 - P{k)P{k') 
= PHk)S^,n' + V-'Tik, -k, k', -k'). (2.11) 
For GRPs, the trispectrum vanishes, and 

a\k,k')=P\k)S^^^,. (2.12) 

Since the survey volume is always smaller than the observable universe, the covariance 
will be modified according to Peldman, Kaiser & Peacock (1994), and the non-Gaussianity 
will further alter the covariance through the trispectrum. 

2.2 One-Dimensional Power Spectrum 

Por a LOS density that is along the a;3-axis and sampled at (xi,X2), the one-dimensional 
Pourier transform gives 

^(x±,n3)=/ (^(x) e-2™3/^ dxa = -2 V <5(n^, ng) e^--^'"^/^, (2.13) 

•^0 ^ nx=-oo 

where the subscript ± signifies the first two components of a vector, i.e. = {xi,X2)- 
Similar to the three-dimensional mass PS, the one-dimensional mass PS is expected to 
follow 

{6{^^,ns)~d*{^^,n',)) = PMm)BS^^y^. (2.14) 
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Substituting equation (2.13) in equation (2.14) and making use of equation (2.5), one 
finds the relation between the one-dimensional mass PS and the three-dimensional mass 
PS, 

oo 

AD(n3) = -^ ^ P{n±,ns), (2.15) 

nj_=— oo 

which is a discrete analog of equation (2.1). 

Practically, one measures PS's of LOS densities sampled at some locations, e.g. 

PU^±,h) = \6{^^,h)\''/B. (2.16) 

A simple estimator of the one-dimensional mass PS may be constructed by a spatial 
average over many LOS's, i.e. 

PUh) = {PU^±,ks))^^, (2.17) 

where (. . .)xj^ stands for a spatial average over all LOS's in the sample. To assess the 
performance of the estimator, two questions need to be addressed: (1) How does P{q(A;3) 
relate itself to P'(k); and (2) What is the covariance of Pij^iks) with respect to Pid(A;3)? 
The rest of this chapter answers the former, and Chapter 3 the latter. 

For simplicity, I assume that LOS's are sampled regularly in transverse directions 
at an interval of 6 = B/m, where m is an integer. Each LOS has a length of B, and 
x_L = bl± with Zi, Z2 = 0, . . . , m — 1. The estimated one-dimensional mass PS is then 

^ m—l 

lx=0 
-. m—l 00 

= J^E| E ^(n.,n3)e— 

lx=0 nx=— 00 



1 ^ 

^ ^ E ^{^±^ri3)S*{n±+mj±,n3), (2.18) 

nx,jx=-oo 

where the equality, 

^ m—l 

^2..(n-„'W- = JK^,^^^. ^■ = 0,±l,...,±OO, (2.19) 



m 
1=0 



has been used to obtain the last line of equation (2.18). It is easy to show with equation 
(2.18) that 

Pmiks) = {PUh)) = (P{D(x±,fc3)). (2.20) 
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Note that there is no spatial average on the far right side of equation (2.20). It seems 
that only in the limit m — oo do P[j){k3) and P'(k) follow the same relation as equations 
(2.1) and (2.15), i.e. ^10(^:3) = J2n^=-oo -P'(k±, k3)/B^, where I have assumed ^(00) = 
so that only = (0,0) terms contribute. In other words, one would need a high sampling 
rate in the transverse direction to accurately recover P'(k) from P[^{ks) regardless the 
scale of interest. This apparently contradicts the new sampling paradigm^ (Unser 2000), 
according to which one only has to prefilter signals and sample them at a minimum rate 
of twice the frequency (wavenumber) of interest. However, the one-dimensional mass PS 
is a projection of the three-dimensional mass PS, so that even a prefiltering of transverse 
modes on small scales will lead to a distortion of the measured one-dimensional mass PS 
on large scales. 

2.3 Aliasing 

Without losing generality, one may choose m to be even, so that equation (2.18) can be 
re-arranged as 

m/2 

PUn3) = ^ J2 ^a(nx,n3) (2.21) 

nx=— m/2 



^ 00 

-2 Yl P'{^±,n3) + A{n3), 
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nx=— 00 



where 



1 I °° 2 
^'a(n±,n3) = -| Yl H^±+'ni±^n3) , (2.22) 



J±=-oo 



and 



^ 00 Tn/2 

A{ns) = ^ J2 J2 S{n^+mi±,n3)S*{n^+mi±+mj'^,n3). (2.23) 

nx=— m/2 



JX:JX= -°° 

j'x^O 

Equation (2.21) provides two equivalent views of the sampling effect. On one hand, the 
one-dimensional mass PS is a sum of the aliased three-dimensional mass PS, Pa(k), that 



major difference between the classic sampling theorem (Shannon 1949) and the new sampling 

paradigm is that the former assumes bandhmitod signals while the latter has relaxed such assumption 
and developed a variety of prefiltering, sampling, and reconstruction techniques. 
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is sampled by a grid of m x m LOS's. On the other hand, it is a complete sum of the 
underlying three-dimensional mass PS with an extra term ^(^3) that is determined by 
the sampling rate and properties of the density field. 

The discrete Fourier transform cannot distinguish a principal mode at < /cNyq from 
its aliases at A; it 2A;Nyq, k it 4A;Nyq, • ■ -, where k^yq is the sampling Nyquist wavenumber. 
For example, ^Nyq = 7r/6 if one samples the field at an equal spacing of b. The alias 
modes will be added to the principal mode if they are not properly filtered out before 
sampling (for details, see Hockney & Eastwood 1981). Since the cosmic density field 
is not bandlimited, aliasing can distort statistics of the field. The distortion on the 
three-dimensional mass PS cannot be quantified a priori, because it depends on relative 
phases between the principal mode and its aliases. The alias effect is less pronounced, if 
amplitudes of the alias modes arc much smaller than that of the principal mode. Since the 
three-dimensional mass PS decreases towards small scales, a high sampling rate (or k^yq) 
can suppress aliasing for modes with k ^ /^Nyq- Alternatively, one can use anti-aliasing 
filters to reduce the alias effect (for instance, with wavelets Fang & Feng 2000). 

Aliasing occurs in the estimated one-dimensional mass PS because the continuous 
density field contains significant Fourier modes at wavenumbers that are greater than 
the sampling Nyquist wavenumber in xi and X2 directions. This is evident in equa- 
tion (2.22). Unlike the three-dimensional case, prefiltering small-scale transverse modes 
may not improve the estimated one-dimensional mass PS, because each mode of the 
theoretical one-dimensional mass PS contains contributions of transverse modes in the 
three-dimensional cosmic density field on all scales. 

If the Fourier modes of the density field are uncorrelated, the term A{ks) may ne- 
glected for a finite number of LOS's and, therefore, validate equation (2.15). Strictly 
speaking, ^(^3) vanishes only as an ensemble average over many GRFs, but since there 
are so many independent modes in a shell of radius around k, especially at large wavenum- 
bers, the summation in ^(^3) will tend to vanish even for a single GRF. The cosmic 
density field is more Gaussian at higher redshift, so equation (2.15) may be a good ap- 
proximation then. At low redshift, however, one might only be able to recover a heavily 
aliased three-dimensional mass PS from a sparse sample of LOS's. 



CHAPTER 3 



23 



COVARIANCE OF THE ONE-DIMENSIONAL POWER 
SPECTRUM: GAUSSIAN RANDOM FIELDS 



As one starts to attempt precision cosmology using the Lya forest (Croft et al. 2002; 
Mandelbaum et al. 2003), it becomes necessary to quantify systematic uncertainties of 
the PS analysis. The covariance of the PS is of interest because it tells us the sample 
variance error of the measured PS and how much the modes on different scales are 
correlated. For GRFs, the variance of the PS (without binning and averaging) equals 
the PS itself. The non-Gaussianity of the cosmic density field will strongly affect the 
covariance of the one-dimensional mass PS on both large and small scales because the 
one-dimcnsional mass PS is an integral of the three-dimensional mass PS to the smallest 
scale possible. 

The covariance of the spatial-average mass PS can differ from that of the ensemble- 
average mass PS for at least two reasons. First, LOS's are no longer independent of each 
other. Correlations between the LOS's could increase the variance of the one-dimensional 
mass PS. Second, the length of each LOS is always less than the size of the universe, so 
that false correlations between two different modes are introduced in the covariance (for 
the three-dimensional case, see Feldman et al. 1994). 

3.1 Covariance of the One-dimensional Power Spectrum 

For the one-dimensional mass PS, the covariance is defined as 



where P{j,{k3) = {PUx±,h))^^ and Piotfcg) = (^10(^3)) = {PU^±,h)) from Chap- 



ter 2. The covariance of the mean PS of N LOS's can be expanded into a sum of pair-wise 



^m(fc3, k',) = {[PUh) - PmikMPiDiK) - PiB{k'^)]) 



(3.1) 



covariances between two LOS's separated by s-^ = x-^ — x^, e.g. 




(3.2) 
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where 



= {PU^^,h)PU^ci, k'^)) - PlBik3)PlBik's), 

and x-^ is the location of the jth LOS in X1-X2 plane. 

For GRFs, the four-point function [equation (2.10)] helps reduce the pair- wise covari- 
ance to 

1 00 

aUm,n',;s) = ^ (^(n^, n3)^*«, n3)^(n'_[, n^^^K, n'3)) 

nx ,n" ,n^'=-C!0 

^g2.^[(„x-nY)■xi+(„'_[-„'_[').xi]/B _ p^^^n^) p^j,{n'^) 

1 °° 2 

= ^2 ^(^^'"3)e^'^^"--/^| (^^3,„, =|e(s,n3)|^<^^3,„,, (3.3) 

nx=— 00 

where the subscript and the superscript are dropped for s. Fourier transforms of ^(s, k^) 
will give the three-dimensional mass PS and correlation function of the density field, 
and ^10(^3) = ^(0,^3) (sec also Vicl ct al. 2002). Because of isotropy, the pair-wise 
covariance cj^j-,(A;3, ^3; s) and ^(s, A;3) depend only on the magnitude of the separation, i.e. 
'7iD(^3,fc3;s) = al^{k^,k'^;s) and ^(8,^3) =^{s,k^). 

If one LOS is sampled from each GRF, the variance of the measured one-dimensional 
mass PS, P[Y,{k^), is (j\j^{k2,,k^) = (72d(A;3, ^3; 0) = P^^j-,(fc3), analogous to the three- 
dimensional case. li N = LOS's are sampled in each GRF on a regular grid as in 
Chapter 2, e.g. s = (1_l — 1^)6, the covariance becomes 

^ m— 1 

criD("3, "3) = T72 cTmina, 113, (1± - 1±)&] 



iV2 

lx,l',=0 



m-1 2 



J2 P{n^,ns)P{n'^,m)\J2 e^-^"^-"^)-!^/- 



nx,!!, =—00 lx=0 



5,^ 



"3 ,"3 



^ 00 

J2 Pin±,m)P{n^ + mj^,ns)6^^y^, (3.4) 
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nx,jx=-oo 



where I have used equation (2.19) to reach the last line. This result can be easily obtained 
using equations (2.10) and (2.18) as well. It is seen that the summation in the last line of 
equation (3.4) runs over only one mode out of every N modes in the Fourier space. If the 
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three-dimensional mass PS were constant, the variance of the mean PS of the N LOS's 
would be N times smaller than the variance of the PS of a single LOS. This is coincident 
with the theory of the variance of the mean. Since the cosmic density field is not a GRF 
in general, equation (3.4) is not expected to give an accurate estimate. 

If the N LOS's are sampled randomly in each GRF, the summation over LOS's in 
the second line of equation (3.4) should be re-cast to read 

_ oo N 

nx,n'j_=— oo j,'=l 

(3.5) 

Since is randomly distributed, with a large number of LOS's the second sum in 
equation (3.5) tends to vanish except for j = I. Thus, one obtains 

. oo ^ 

^m(n3, «3) ^ ^ E ^("^' "3)P(ni, m) 6^^^, = -P^ns) S^,^,^. (3.6) 

nx,nY=— oo 

Again, it shows that the variance of Pl^{ks) is inversely proportional to the number of 
LOS's, but this is valid only for a large number of LOS's randomly sampled in a GRF. 

Through the Gaussian case one can find the lowest bound of the uncertainty for 
a measured one-dimensional mass PS and the minimum number of LOS's needed for 
a target precision. For example, to measure the one-dimensional mass PS of GRFs 
accurate to 5% on every mode, one needs at least 400 LOS's. However, the cosmic 
density field has a far more complex covariance due to its non-vanishing trispectrum, 
which can significantly increase the variance of the measured one-dimensional mass PS. 
The trispectrum introduces an extra term to the covariance of the one-dimensional mass 
PS in addition to the Gaussian piece [equation (3.4)], i.e. 

1 °° 

(^?D(^i3,n'3) = -^ E ^(n±,n3)P(nx + mjx,n3),5^3„, +riD(n3,n'3), (3.7) 

n±,jx=-oo 

where 

1 °° 

Tm{n^,n'^) = ^ E T(n, -n - mj, n', -n' - mj'), (3.8) 

n±, j±, n'j^, and j'j^ are the transverse components of n, j, n', and j', respectively, and 
J3 = is = 0- III deriving equation (3.7), I have made use of equations (2.10) and (2.18). 
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3.2 Line-of- Sight Length 



Observationally, the length of a LOS is always much less than the size of the observable 
universe. In this case, the one-dimensional mass PS and its covariance must be re- 
formulated. 

For a LOS from (xx,0) to {x.±,L), its Fourier transform is 



x±,n3)= / 5(xx,X3)e-2™3/-^dx3 
Jo 



T ^ , p2-Ki{nzL/B~hz) _ -1 

^ 5 „n)e— /-i-^^;^^^. (3.) 



so that 

(5L(xx,n3)5£(x^,n'3)) = -^ (-lf3-"3p(nHn3,n3)i/;(n3,n'3), (3.10) 



n=— oo 



where 

/ ~x sin[{n2.L/B - ra3)7r] 

w{nz,nz) = — — ^— . (3.11) 

iT(n2,L/B - ns) 

The factor w{ns, 713) acts like a window function that mixes Fourier modes of the density 
field along direction into the one-dimensional Fourier mode at h^. Because u;(n3,n3) 
and w{ns,'h'^) are not orthogonal to each other, the term {SL{'x.±,hs)Sl^{yi±,h'^)) is no 
longer diagonal with respect to and n'^. However, it remains diagonal-dominant. 
One may define an observed one-dimensional mass PS at x^ as 

Pm{^±,n3) = (|^L(x±,n3)P)/L. (3.12) 

The ensemble- aver aged one-dimensional mass PS is 

L °° 

^iD(n3) = (^'iD(x±,n3)) = ^ P{n)w\n3,h3). (3.13) 

n=— 00 

The covariance of Pid("'3) for GRFs is 

^m("3,?^3) = ([^iD(x±,n3) - PiD(ri3)][AD(x_L,n3) - -PiD(n3)]) 

As expected, the covariance is not diagonal because of the window function u) (713, 77.3). 



^2 °o 

XI ^(n)-P(nO«^('^3,?^3)w'(f^3,n3)u;(n3,n3)w;(n3,n3). (3.14) 

n,n'=— 00 



CHAPTER 4 
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COVARIANCE OF THE ONE-DIMENSIONAL POWER 
SPECTRUM: THE COSMIC DENSITY FIELD 

Even at moderately high rcdshift, the cosmic density field is already quite non-Gaussian 
on scales below 10 h'^Mpc (Feng k Fang 2000; Zhan et al. 2001; Zhan & Fang 2002). 
When projected in one dimension, i.e. equation (2.1), the small-scale non-Gaussianity 
will obviously affect the measured one-dimensional mass PS on much larger scales. A non- 
vanishing trispectrum arises because of the non-Gaussianity, and it introduces an extra 
term, Tij^{ks,k'^), to the covariance of the one-dimensional mass PS [see equation (3.7)]. 
Although it is possible to derive the trispectrum based on the halo model (e.g. Cooray 
& Hu 2001), the one-dimensional projection, unfortunately, obscures the contribution of 
the trispectrum to the one-dimensional mass PS. Therefore, numerical simulations are 
necessary for the study. 

4.1 Simulations 

Three A'^-body simulations of 256^ cold dark matter (CDM) particles are used to quan- 
tify the covariance of the one-dimensional mass PS. The model parameters are largely 
consistent with WMAP results (Spergel et al. 2003), e.g. (J^, J^b, ^^A, as, n) = (0.27, 
0.04, 0.73, 0.71, 0.85, 1), where O is the cosmic matter density parameter, Ob the baryon 
density parameter, J^a the energy density parameter associated with the cosmological 
constant, the rms density fluctuation within a radius of 8 /i~^Mpc, and n is the power 
spectral index. The box sizes of the simulations are 128 /i~^Mpc (labeled as B128), 256 
/i"^Mpc (B256), and 512 h'^Mpc (B512). The baryon density 9.^ is used only for the 
purpose of calculating the transfer function using LINGER (Ma &; Bertschinger 1995), 
which is then read by GRAFIC2 (Bertschinger 2001) to generate the initial condition. The 
CDM particles are evolved from z = 44.5 to present using GADGET (Springel, Yoshida &; 
White 2001). 

The simulation produces snapshots at 2; = 3 and 0. For each snapshot, the particles 
are assigned to a density grid of 512^ nodes using the triangular-shaped-cloud (TSC) 
scheme (Hockney &: Eastwood 1981). LOS's are then sampled along the rcs-axis. The 
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particle Nyquist wavenumber {kp = 27r, 7r,7r/2 h Mpc"-*^) sets a nominal cut-ofF of the 
wavenumber beyond which the Fourier modes of the density field cannot be represented 
by discrete particles. Of course, since the simulation code retains higher-wavenumber 
perturbations to the particles, the actual cut-off can be somewhat higher. Nevertheless, 
the Nyquist wavenumber of the density grid {2kp) should be sufficient to accurately 
recover most of the Fourier modes contained in the particle distribution. 

Because the density grid has a finite resolution, both the coordinates and the wavenum- 
bers are discrete and finite. The equations in Chapters 2-3 need to be modified accord- 
ingly, so that they do not sum over non-existing modes. For example, equation (2.15) 
becomes 

^ M/2 

PiD{ns) = ^ J2 P{^±,ns), (4.1) 

nx=-M/2 

where M = 512 is the number of nodes of the density grid in transverse directions. 

For brevity and the purpose of comparing covariances, I introduce the following nor- 
malized covariances: 

C{k3,k's;s) = al^{k-i,k's;s)[Pij,{ks)Pij,{k'3)]-\ (4.2) 

which is the pair- wise covariance between two LOS's separated transversely by a distance 
s and scaled by the one-dimensional mass PS; 

C(fe3,fc3) = c7l^iks,k'^)[Pmiks)P,nik'^)]-\ (4.3) 

which is the covariance of the estimated one-dimensional mass PS, e.g. equation (2.18), 
scaled by the one-dimensional mass PS; and 

Ciks,k's) = Ciks,k'^)[C{k3,ks)C{k':„k'^)]-'/\ (4.4) 

For GRFs, all these covariances are diagonal in matrix representation, if the length of 
LOS's is the same as the size of the simulation box. In addition, C{ks,k3;0) = 1 and 
C{k3, ks) = N~^, where N is the number of LOS's that are sampled for estimating the 
one-dimensional mass PS. The advantage of C{ks, k'^) is that (7(^3, k^) = 1 for all fields, 
so that they can be compared with each other in a single (grey) scale. 



29 



s = 3/j-iMpc 















if : ■: : J I : i 










03 



1 2 3 4 1 2 3 4 1 2 3. 4 

if 3 {tt/IS ft Mpc-*) 

Figure 4.1: Normalized pair-wise covariances C{k3, k'^; s) [see equation (4.2)] in grey scale. 
The covariances are calculated over 2000 pairs of LOS's sampled at a fixed separation s 
from the simulation B512 at z = 3 (lower panels) and from 2000 GRFs that have the same 
three-dimensional mass PS as the simulation (upper panels). The grey scale extends to 
—0.1, which corresponds to white. 



4.2 Pair- Wise Covariance 

The normalized pair- wise covariance C{ks,k'^;s) is shown in Fig. 4.1 for GRFs and the 
simulation B512 at 2; = 3. The covariances for the GRFs are averaged over an ensemble 
of 2000 random realizations, while those for the simulation are averaged over 2000 pairs 
of LOS's from a single field. The behavior of the covariances is consistent with the 
expectation. Namely, 0(^3,^:3; 0) ~ S^^ ^, and C{k3,ks;s) decreases as the separation s 
increases. The simulation does deviate from GRFs because of the non-Gaussianity, which 
increases the variance C{ks, k3; s). Fig. 4.2 compares C{k3, ka; s) with that from equation 
(3.3). I note in passing that the expected values of C{k3, ks; 8 h~^Mpc) are so close to 
that they are even below the rms value of the off-diagonal elements of C(A;3, k'^; 8 /i~^Mpc) 
for 2000 GRFs. Hence, it is practically difficult to recover three-dimensional statistics 
from criD{k3, ks; s) or ^(s,/c3) if the LOS's are too far apart. In theory, the modes of 
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Figure 4.2: Diagonal elements of the covariance matrix (7(^3, k'^;s). The horizontal solid 
line in the left panel marks the rms value of the off-diagonal elements in the s = 8 ^~^Mpc 

case. Other lines are from direct summations of the three-dimensional mass PS using 
equation (3.3). The symbols are measured from GRFs (left) and the simulation B512 at 
z = 3 (right). For GRFs, C(A;3,A:3;0) is expected to be unity. 



two LOS's are always correlated as A; — >■ 0, regardless of their separation. However, 
the correlation for s > 8 h~^Mpc is so weak that it will not be easily detected against 
statistical uncertainties. Thus, Figs. 4.1 and 4.2 suggest that LOS's sampled in a single 
cosmic density field are practically independent of each other as long as s > 8 h~^Mpc. 

4.3 Covariance 

Fig. 4.3 shows the one-dimensional mass PS measured by averaging over 64 and 1024 
LOS's from the B128 simulation. Although the mean PS of all the 512^ LOS's agrees 
with the result of a direct summation of the three-dimensional mass PS using equation 
(4.1), the deviation of the PS for any particular group of 64 or 1024 LOS's is substantial, 
especially at 2; = 0. The variance is smaller at z = 3 than at 2; = because the cosmic 
density field is more Gaussian earlier on. It is roughly inversely proportional to the 
number of LOS's. This can seen better by comparing the lower right panels of Figs. 4.4 
and 4.5. However, even at 2; = 3 the variance of the one-dimensional mass PS is still much 
higher than A^~^P^j-,(fe3), i.e. the variance for GRFs, which indicates a heavy contribution 
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Figure 4.3: Estimated one-dimensional mass PS's P[jy{ks). The PS's are measured by 
averaging over 64 (light grey) and 1024 (dark grey) LOS's that are drawn from the B128 
simulation. Shaded areas mark la dispersion of the PS among 2000 (light grey) and 256 
(dark grey, as 256 x 1024 exhausts all the 512^ LOS's) distinct drawings. Circles axe the 
mean PS's for dark grey areas, i.e. the mean of 512^ LOS's. Sohd hues are results of a 
direct summation of the three-dimensional mass PS using equation (4.1). The LOS's are 
sampled on a grid with fixed spacing in the left panels but drawn randomly in the right 
panels. 
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from the trispectrum. The formulae in Chapters 2 and 3 often assume that LOS's are 
sampled on a grid with fixed spacing, which may not be applicable to realistic data such as 
inverted densities from the Lja forest (Nusser & Haehnelt 1999; Zhan 2003). Therefore, 
I sample the LOS's in two ways in Fig. 4.3: grid sampling and random sampling. Since 
no significant difference is observed, random sampling can be safely applied in the rest 
of this dissertation. 

The normalized covariances C{ks,k'^) and C{k^,k'^) are quantified in Figs. 4.4 [N = 
64) and 4.5 {N = 1024). The left column in each figure is the covariances of the spatially 
averaged one-dimensional mass PS from a single GRF that has the same box size and 
three-dimensional mass PS as its corresponding simulation. Clearly, the covariances 
based on spatial average are nearly diagonal with unity diagonal elements. This is in 
agreement with the expectations based on ensemble average for GRFs and is consistent 
with the ergodicity argument. The middle column is similar to the left column except 
that the density fields are from simulations at z = 3. The modes in the simulated density 
fields are strongly correlated, so that the covariances are no longer diagonal. In other 
words, the trispectrum is non-vanishing for the cosmic density field, as it is the only 
term that contributes to off-diagonal elements in the covariance. The cosmic density 
field becomes so non-Gaussian at 2; = that grey scale figures of the covariances will 
not be readable. Hence, I only plot four cross sections perpendicular the diagonal with 
Q = (^3 + A;3)/(7r/4 h Mpc~^) for B128 and B256 in the right column. The dominance of 
the diagonals suggested by these cross sections is actually weaker than that in the middle 
column, which can be seen by contrasting the cross sections for B128 at 2; = in Fig. 4.4 
with that for B128 at z = 3 in Fig. 4.6. 

The non-Gaussianity is reflected not only in the correlations between different modes 
but also in the variance of the one-dimensional mass PS, i.e. the diagonal elements of 
the normalized covariance C{k3, k'^). The lower right panels of Figs. 4.4 and 4.5 compare 
C{k^, k^) for five different density fields. The variance from the simulation B128 is orders 
of magnitude higher than N~^P^jy[k3), and it grows as the non-Gaussianity becomes 
stronger toward z = 0. As a result, the sample variance error estimated for GRFs is 
much lower than what one can actually measure from the cosmic density field. According 
to equation (3.7), both aliasing and the trispectrum contribute to the variance. Since 



33 




e & i^- I? 




.1 I'l l 11 I I ttljlVI I'I'I'I I I I 




4 8 12 16 2 4 6 8 
B256 z = 3 B2Se z = 



J I M I I I I 1 .1.1 |- 1. 1. 1- 1 I I I I I I q 
— (J=£ J 




# ft P ^ 

B5t2 z a 



12 3 4 
if,- if;, (ttA /I Mpc-1) 




-is I 



M I r I I I M I I I I I J I I I 

B12S^=0 



^ - IS ^ * * 



1 

0.7 

tP 

1; 



12 3 4 
(Tr/4 h Mpc^') 



t 



4 



Figure 4.4: Normalized covariances C{ks,k'^) [see equation (4.4)] averaged over 2000 
groups, each of which consists of 64 LOS's (N = 64). For each panel in the left, the LOS's 
are randomly drawn from a single GRF that has a box size of 128 h~^Mpc (R128), or 256 
h-^Mpc (R256), or 512 h'^Mpc (R512). The GRFs have the same three-dimensional 
mass PS as their corresponding simulations at z = 0, but note that C{k^,k'^) is inde- 
pendent of redshift for GRFs. Similarly, the middle column is for simulations at 2; = 3. 
The covariances are shown in a linear grey scale with black being 1.2 and white less 
than or equal to 0. At z = 0, the normalized covariances become much less diagonally 
dominant than the B128 z = 3 panel in the same grey scale, so only 4 cross sections 
along Q = {k^ + k!^)/{TT/4: h Mpc~^) are plotted for B128 and B256. Diagonal elements 
C{k3,k3) [see equation (4.3)] of R128 (squares), B128 at z = (circles), B128 at 2 = 3 
(diamonds), R512 (crosses), and B512 at z = 3 (pentagons) axe shown in the lower right 
panel along with a thin solid line marking the value 1/64, i.e. the expectation for GRFs. 
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Figure 4.5: The same as Fig. 4.4, except that the normahzed covarianccs C{h^,k'^) are 
averaged over 256 groups, each of which consists of 1024 LOS's (iV = 1024). The thin 
solid line in the lower right panel marks the value of 1/1024. 
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the GRFs have the same three-dimensional mass PS and are sampled in the same way 
as the simulations, their near-Gaussian variances of spatially averaged one-dimensional 
mass PS's suggest that the contribution of the aliasing effect is negligible. Comparisons 
of C{ks,k3) for the same density field but with different sizes of sample {N = 64 and 
1024) confirm the observation in Fig. 4.3 that the variance of the one-dimensional mass 
PS scales roughly as N"^. This means even though the non-Gaussianity drives up the 
sample variance error of the measured one-dimensional mass PS, one can still reduce the 
error by sampling a large number of LOS's. 

The distribution function of line-of-sight column densities (upper left panel of Fig. 4.6) 
suggests that the long tail of high-eolumn-density LOS's may affect the covariance. I re- 
calculate the covariance C{k^,k'^) with a selection criterion that the column density of 
each LOS Pcol/Pcol < 3. The result is shown in the upper middle panel of Fig. 4.6. The 
diagonal elements are more dominant than those in the B128 z = 3 panel in Fig. 4.4. The 
first two lower panels are the cross sections of C{k-^, fcg) with (middle) or without (left) the 
selection criterion. They provide a more quantitative comparison, which suggests that 
rare high-column-density LOS's do increase the correlations between different modes of 
fluctuations. Meanwhile, Fig. 4.7 clearly demonstrates that these LOS's also increase the 
variance of the one-dimensional mass PS by at least a factor of 2 on all scales. 

For a fixed number of particles, the simulation box sets a cut-off scale, below which 
the fluctuations cannot be represented. In other words, the number of particles and 
the size of the simulation determine the highest-wavenumber modes that are included in 
calculations of the one-dimensional mass PS and its covariance. Since the non-Gaussianity 
is stronger at smaller scales, a larger simulation box cuts off more small-scale fluctuations 
and may cause the correlation to appear weaker in Figs. 4.4 and 4.5. To test this, I assign 
the density field of the simulation B128 at z = 3 on a grid of 128^ nodes. The spatial 
resolution is the same as the simulation B512 on a grid of 512^ nodes. The covariance 
C{k3,k'^) is calculated in the same way as those in Fig. 4.4 but with fewer groups of 
LOS's. Each group still has 64 LOS's. The results are shown in the right column of 
Fig. 4.6 and in Fig. 4.7. Evidently, there is a significant reduction of the correlations 
between different modes as well as the variance of the one-dimensional mass PS. Indeed, 
the variance from the low-resolution calculation is close to that from the B512 simulation. 
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Figure 4.6: The effect of non-Gaussianity and resolution on the covariance. The left 
column shows the distribution function of column density Pcoi/Pcoi (upper panel) and 
cross sections of the normalized covariance (7(^3, k'^) along Q = (/ca + k'^)/ {tt/4 h Mpc~^) 
(lower panel) for B128 at z = 3. The middle column shows C{ks, fcg) (upper panel) and 
its cross sections (lower panel) for the same simulation output but with an exclusion of 
318 LOS's that have Pcoi/Pcoi > 3. The normalized covariances in the first two columns 
are calculated in the same way as in Fig. 4.4, i.e. with 2000 groups and N = 64. Thus, 
about half of the 318 LOS's would be selected without the criterion Pcoi/Pcoi < 3. The 
right column is similar to the middle column, but the density is assigned on a grid of 
128^ nodes. All the 128^ LOS's are selected and divided into 256 groups with N = 64. 
The grey scale is the same as Fig. 4.4. 
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Figure 4.7: The effect of non-Gaussianity and resolution on the variance, i.e. diagonal 

elements of the covariance a"^j-,(A;3, /cg). Diamonds are the normalized variance C{ks,k'^) 
for B128 at z = 3. Circles correspond to the middle column of Fig. 4.6, which imposes 
the selection criterion Pcoi/Pcoi < 3. Squares are from the low-resolution calculation in 
the right panel of Fig. 4.6. The horizontal line marks the Gaussian value of 1/64. 




Thus, the apparent resemblance between B512 and R512 in Figs. 4.4 and 4.5 is mostly 
due to the low resolution of the large-box simulation. 

It is expected from equation (3.14) that the covariance matrix will not be diagonal 
if the length of LOS's is less than the size of the simulation box. Fig. 4.8 shows the 
covariances that are calculated in the same way as those in Fig. 4.4 for the GRF R512 
and the simulation B512, except that each LOS is only 128 h~^Mpc long. The effect of 
the length is not visible for the GRF, but it does increase the correlation between different 
modes and doubles the variance of the one-dimensional mass PS for the simulation B512. 
For real observations, the line-of-sight length is always much less than the size of the 
observable universe, so that the window function in the line-of-sight direction will cause 
stronger mixing of modes and more pronounced increase of correlation and variance. 



CHAPTER 5 
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COVARIANCE OF THE ONE-DIMENSIONAL POWER 
SPECTRUM: THE LYa FOREST 

Previous chapters have focused on the mass PS and its covariance. However, one does 
not directly observe one-dimensional density fields. What can be observed instead is 
the Lya flux. Therefore, many works have been based on flux statistics, which are then 
used along with simulations to constrain cosmology. Here I examine the flux PS and its 
covariance using simulated Lya forests. 

5.1 Simulated Lya Forests 

The Lya forest probes deeply into the nonlinear regime of the cosmic density field. This 
has made numerical simulations an indispensable tool for understanding the nature of 
the Lya forest and inferring cosmological parameters from flux statistics. Two types of 
cosmological simulations have been commonly used to simulate the Lya forest. One is 
pure CDM simulations (A^-body simulations) that assume baryons to trace dark matter 
(e.g. Petitjean, Miicket & Kates 1995; Riediger ct al. 1998). The other is hydrodynamical 
simulations (e.g. Cen et al. 1994; Zhang et al. 1995; Hernquist ct al. 1996). Other types 
of simulations exist as well. For example, the simple log-normal model (Bi & Davidsen 
1997) is already able to reproduce some Lya flux statistics. 

I use both A''-body simulations and hydrodynamical simulations to investigate the flux 
PS and its covariance. Readers are referred to the above references for details of simula- 
tion techniques, and I describe the procedure of extracting Lya forests from simulations 
in this section. 

5.1.1 Hydrodynamical Simulations with Photoionization 

Two hydrodynamical simulations (HYDRO 1 & HYDR02) were provided by Romeel 
Dave. They are both variants of the low-density-and-flat CDM (LCDM) model with 
a slight tilt of the power spectral index n (see Table 5.1). HYDROl evolves 128^ CDM 
particles and 128^ gas particles from 2; = 49 to using Parallel TreeSPH (Dave, Dubinski 
&: Hernquist 1997). HYDR02 differs from HYDROl only in cosmological parameters. 
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Table 5.1: Parameters of the simulations. 



Model 



Type 



n n^"" Qa h 



n 



HYDRO 1 


Hydro. 


0.4 


0.05 


0.6 


0.65 


0.95 


0.8 


HYDR02 


Hydro. 


0.3 


0.04 


0.7 


0.7 


0.95 


0.8 


HIGHn 


AT-Body 


0.3 


0.04 


0.7 


0.7 


1.1 


0.8 


HIGHo- 


AT-Body 


0.3 


0.04 


0.7 


0.7 


1.0 


1.0 


LCDM 


iV-Body 


0.3 


0.04 


0.7 


0.7 


1.0 


0.8 


OCDM 


iV-Body 


0.3 


0.04 





0.7 


1.0 


0.8 



" With exceptions of HYDRO 1 k HYDR02, the baryon 
density parameter is used only for generating the initial 
mass power spectrum. 



and it has snapshots available down to 2; = 2. The box size is 22.222 h Mpc in each di- 
mension with a 5 h~^kpc resolution. The two simulations also include star formation with 
feedback and photoionization (Katz, Weinberg & Hernquist 1996). The UV ionization 
background is from Haardt & Madau (1996). 

5.1.1.1 Density Grid 

Snapshots of the simulations contain the position r-j and velocity Vj of each particle, 
where i is the label of the ith particle. Smooth-particle hydrodynamics (SPH) defines 
the baryon density Pb(x) location to be a sum of contributions from all nearby 

gas particles, i.e. 



where A^p is the total number of particles, w is the density kernel or the assignment 
function, nii is the mass of particle i, and ej is the smoothing length determined by the 
distance between particle i and its jth neighbor (j = 32 in this chapter). In practice, 
densities are assigned on a discrete grid for further analysis. 

I adopt a spherically symmetric spline kernel from Monaghan &: Lattanzio (1985, with 
a typo corrected), which is also used in TreeSPH for force calculations. It has the form 




(5.1) 




< r < e 



r > 2e 



€<r <2e 



(5.2) 
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Figure 5.1: Fourier transform of the assignment function w{r, e). 



which vanishes beyond the radius 2e and has a smooth gradient everywhere. The Fourier 
transform of the kernel is 



w{k, e) = w{ke) 



48 



(4 — ke sin ke) sin^ — — sin^ ke 



(5.3) 



Fig. 5.1 shows that the density kernel is an effective low-pass filter that suppresses fluctu- 
ations on scales smaller than 2e{k> Tr/e). This filtering is necessary because fluctuations 
on scales less than the physical size of an SPH particle are unphysical. Furthermore, to 
reduce the alias effect (see Chapter 2), the smoothing length e is required to be greater 
than or equal to the spacing of the density grid. Other kernels such as the TSC and 
wavelet scaling functions have also been used for similar purpose. 

It is important to realize that the filtering scale should be adjusted to particle con- 
centration — the denser the environment, the smaller the filtering scale. A particle in an 
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empty region does not represent a condensed clump of matter sitting in vacuum but rather 
a dilute distribution that fills the space between the particle and its distant neighbors. 
Thus, particles in empty regions should not contribute to small-scale fluctuations. On the 
other hand, particles in high density regions contain more small-scale information, and 
their filtering scale should be smaller, i.e. a higher cutoff wavenumber. Density kernels 
based on neighbor distances satisfy such requirement, and they are widely used in SPH 
simulations. When the scale of interest is larger than or comparable to the mean inter- 
particle distance {k < kp, see Chapter 4), kernels with an indiscriminating filtering scale 
for all particles are just as good. Otherwise, a density-dependent kernel must be used. 

The density field pb (x) can be readily constructed once the smoothing length is deter- 
mined for each particle. In principle, LOS's may be sampled in any random direction, but 
for computational simplicity I assign the density on a grid of 256'^ nodes, and then extract 
one-dimensional densities randomly from this grid. Meanwhile, particle temperatures are 
also assigned to each node with weights proportional to each particle's contribution of 
density at that node. 

5.1.1.2 Lya Flux 

I assume a universal hydrogen fraction of 0.76 to convert baryon densities to hydro- 
gen densities. For each density node, the equilibrium Hi fraction is calculated from 
the balance between cooling and heating, which include adiabatic cooling (cosmic ex- 
pansion), photoionization, recombination, collisional excitation and ionization, thermal 
Bremsstrahlung, and Compton scattering with cosmic microwave background (CMB) 
photons (for details, see Katz et al. 1996). R. Dave provided codes for this calculation. 
The assumption of ionization equilibrium certainly breaks down in very dynamic regions 
such as shocks. However, since the equilibrium H I fraction calculated for shocks is already 
considerably lower than that elsewhere, there will not be much an effect on simulated 
Lya forests even if additional shock physics can further reduce the H I fraction by orders 
of magnitude. Besides, shock fronts, unlike shocked gases, occupy only a small fraction 
of the total simulation volume, so they could not have a great impact on the Lya forest. 

With the Hi fraction and hydrogen density along the LOS, one can determine the 
Lya optical depth r and transmitted Lya flux F of each pixel, i.e. each node of the 
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density grid. The mean flux F of all LOS's is well determined by observations. I adjust 
the intensity of the UV ionization background Fuv so that the mean flux of all pixels in 
the simulations follows 



which is adapted from Kim et al. (2002) and Dave et al. (1999). This mean flux formula 
is also consistent with other observations (Lu et al. 1996; Ranch et al. 1997; McDonald 
et al. 2000). There is a shght inconsistency that HYDRO 1 and HYDR02 have already 
included the UV ionization background, yet I need to adjust the intensity of the UV 
radiation on outputs of the simulations to fit mean fluxes. This inconsistency does not 
significantly affect the results that follow because, first, the simulation outputs are able 
to reproduce the observed mean flux with their internal UV ionization background (Dave 
et al. 1999). External adjustments are only needed to vary the mean flux within the 
given observational and numerical uncertainties. Second, the UV ionization background 
has an important role in the evolution of the IGM temperature (Katz et al. 1996), but 
the large-scale distribution of baryons is driven by gravity. Thus, even if the intensities 
of the externally adjusted UV background were used internally in the simulations, LOS 
Lya absorptions should not change greatly. 

The mean temperature of the IGM is on the order of 10"^ K, so thermal broadening 
of absorptions must be taken into account. At a temperature Tj, the flux decrement 
Di = 1 — Fi of pixel i is spread to pixel j as 



where 6 2± 13 km s , H is the Hubble constant at corresponding redshift, d is the phys- 
ical separation mapped by two adjacent pixels, and T4^i = rj/(10^ K). The ionization- 
equilibrium temperature can be uniquely determined from density, and it is obtained for 
each pixel while Hi fraction is calculated recursively. Because of shock heating, especially 
at low redshift, ionization-equilibrium temperatures are often lower than density- weighted 
SPH temperatures of the density grid. In this case, the latter is used for thermal broad- 




0<z< 1.5, 



1.5 < z < 4 



(5.4) 




Rl/2 



(5.5) 
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ening. The broadened flux F is then 

M 

Fi = l-J2Dij, (5.6) 

where M is the number of pixels along the LOS, but practically the summation is over a 
much smaller number of pixels where Dij is significant. 

Thermal broadening smoothes out small-scale fluctuations in the Lya forest without 
altering the mean flux. Therefore, it preferentially reduces the flux power on small scales. 
At z = 3, the majority of Lya absorptions arise from regions of density p/p ^ 10, where 
temperatures are not much above 10^ K and broadening widths b^/T4 are a few tens 
of km s~^. At z = 0, however, Lya absorptions are produced in much hotter regions 
where broadening widths can be as high as hundreds of km s~^. In other words, thermal 
broadening is a much stronger effect at lower redshift, which can be seen by comparing 
simulated Lya forests at z = (Fig. 5.2) with those at z = 3 (Fig. 5.3). 

There is a general correlation between the temperature and Lya absorption due to the 
temperature-baryon density-Hi fraction relation (see Section 5.1.2.2). However, high- 
temperature regions (T > 10^ K) at z = are not always related to Lya absorptions 
because they are often too hot and too dilute to maintain an appreciable Hi column 
density. These high-temperature regions are categorized as the shock-heated 10^-10^ K 
WHIM (Dave et al. 1999; Dave & Tripp 2001; Dave et al. 2001). 

The cosmic density field becomes more and more clustered so that for any random 
LOS the chance of passing through relatively high density regions that can produce 
detectable Lya absorptions is much smaller at z = 0. This is reflected in Figs. 5.2 
and 5.3 that low-redshift Lya forests have much fewer absorptions per unit comoving 
distance than high-redshift ones. It is also interesting to note that there is a relatively 
deep Lya absorption in the last panel of Fig. 5.2 despite the high mean flux of 0.97 
at z = 0. The LOS in this panel is the same as that in the second and fourth panels 
of Fig. 5.5, where LOS baryon and dark matter densities are shown in real space and 
redshift space, respectively. The deep absorption actually arises from a nearly virialized 
cluster of real-space density p/p ~ 160. 
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Figure 5.2: Thermal broadening of Lya absorptions at z = 0. Solid lines are Lya absorp- 
tions with thermal broadening, and dotted lines without. Grey lines are temperatures 
along LOS's. The Lya forests are generated from baryon distributions in HYDROl with 
the assumption of ionization equilibrium. 
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Figure 5.3: Thermal broadening of Lya absorptions at z = 3. Legends are the same as 
in Fig. 5.2. 
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5.1.1.3 Line of Sight in Redshift Space 

So far I have not mentioned the fact that the Lya forest is observed in redshift space 
where the distribution of matter is distorted by peculiar velocities. A simple way to 
approximate redshift distortion is to displace each particle a distance v^/H along the 
LOS before constructing the density grid, i.e. 

rf = + ^X3, (5.7) 

where the superscript S stands for redshift space, and I have made use of the plan- 
parallel approximation and assumed X3 to be in the LOS direction. Subsequently, the 
redshift-space density p^{'X-) is 

p^(x) =^miu;(|x-rf|,ef), (5.8) 
j=i 

where the smoothing length of each particle is also obtained in redshift space. Then, Lya 
forests can be extracted from (x) in the same way as discussed above. A more detailed 
discussion on the effect of redshift distortion is given in Section 5.1.2.1. 

5.1.2 Pseudo- Hydro Technique 

Although full hydrodynamical simulations are well suited for studies of the Lya forest, 
they are currently too time-consuming to explore a large cosmological parameter space 
as one often desires. Whereas, iV-body simulations run much faster, and they can be 
used to cover a wide range of cosmological models in practical time. 

Croft et al. (1998) proposed a pseudo-hydro technique for generating Lya forests 
from A^-body simulations. It is based on two important theoretical expectations that 
are supported by hydrodynamical simulations: (1) In general, baryons trace dark matter 
above the Jeans scale (e.g. Gnedin &: Hui 1998); and (2) In ionization equilibrium, the 
equation of state (EOS) of the IGM gives rise to an approximate temperature-density 
relation 

r = ro(pb/Pb)", (5.9) 

where Tq ~ 10^ K, 0.3 < a < 0.6, and pb/Pb < 10 (Hui & Gnedin 1997). Since the Lya 
optical depth is proportional to T"^-"^ in regions around the mean density, one finds 

F ~ e~^^'^/^^'^ ~ g-^(Pd/pd)^^ (5.10) 
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Figure 5.4: The correlation between baryon and dark matter densities. Grey dots are 
for z = 3, and black dots z = 0. Each group consists of 4000 pairs of density values 
randomly selected from the density grid of the simulation HYDRO 1. The diagonal line 
follows Pb/Pb = Pd/Pd- 



where A oc ri^FyyTQ 7 = 2 — 0.7a, and pd is the dark matter density. Usually the 
constant A is left as a fitting parameter adjusted to reproduce the observed mean flux. 

5.1.2.1 Correlation between Baryons and Dark Matter 

To justify the pseudo- hydro technique, one must show that baryons and dark matter trace 
each other on large scales. For a simple test. Fig. 5.4 compares baryon densities with dark 
matter densities from the same set of 4000 randomly selected density nodes of HYDRO 1. 
There is clearly a strong correlation between baryons and dark matter (see also Gnedin & 
Hui 1998). The correlation has larger scatter at lower redshift because gravity is no longer 
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the dominant driving force behind strong hydrodynamical events such as shocks, which 
occur more frequently at lower redshift. Baryons are slightly denser than dark matter 
below the mean density at 2; = because of finite pressure of SPH particles (Gnedin k. 
Hui 1998). In other words, SPH particles have much larger smoothing radii than CDM 
particles, so that SPH particles in a low-density region receive less acceleration toward a 
nearby high-density region than CDM particles. 

The Jeans length sets a characteristic scale, below which baryon pressure will resist 
the growth of gravitational perturbations and cause baryons not to trace dark matter. 
By a simple comparison between the dynamical time and sound travel time, one finds 
the comoving Jeans length 



= (1 + z) 



Gp y 9i^mpH^n p{l + z) 
= 780 tI^'^ [n {p/p){l + z)]~^/^ /i-^kpc, (5.11) 

where Cg is the speed of sound, G is the gravitational constant, A;b is the Boltzmann 
constant, p 2± 0.59 is the mean molecular weight, mp is the proton mass, and Hq = 100 h 
km s~^ Mpc~^. With the EOS equation (5.9), the Jeans length can be rewritten as 



1/2 

Lj = 780 



1 



1/2 /-N (l-a)/2 

' ^ /i-^kpc. (5.12) 



104 Ky [0(l + z)_ 

For To = 10^ K and O = 0.4 (HYDROl), it is expected that baryons in regions around 
the mean density to generally follow dark matter above 1.2 h~^Mpc (0.6 h~^Mpc) at 
z = {z = 3). Of course, in vast low density regions the Jeans length may be much 
larger. 

The Jeans length analysis idealizes the state of baryons and neglects external forces. 
Thus, it is not applicable in very dynamic regions such as shock fronts. For example, in 
the spherical collapse case, even though baryons may initially follow dark matter, shocks 
could eventually develop and allow baryons to separate from dark matter. 

Figs. 5.5 and 5.6 examine baryon and dark matter densities along LOS's in both real 
space and redshift space. The two LOS's in Fig. 5.5 (Fig. 5.6) correspond to the LOS's 
in the second and last panels of Fig. 5.2 (Fig. 5.3). Baryon densities and dark matter 
densities have almost one-to-one correspondence in real space at 2; = 3. However, they 
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Figure 5.5: LOS baryon densities (dashed liens) and dark matter densities (solid lines) 
at z = 0. The upper two panels show LOS's in real space, while the lower two the same 
LOS's in redshift space. All densities are extracted from HYDROl. The two LOS's here 
correspond to those in the second and last panels of Fig. 5.2. 



51 




5 10 15 20 

Comoving Distance (ia~^Mpc) 



Figure 5.6: The same as Fig. 5.5 but at z = 3. The two LOS's here correspond to those 
in the second and last panels of Fig. 5.3. 
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Figure 5.7: Three-dimensional and one-dimensional mass PS's of baryons (dashed lines) 
and dark matter (solid lines) from HYDROl. Redshift-space PS's are shown in thick 
lines, and real-space PS's thin lines. 



do not share the same velocity structure because SPH particles receive not only gravita- 
tional accelerations but also hydrodynamical accelerations. This leads to the departure 
of baryons from dark matter in redshift space. At z = 0, the difference between baryon 
and dark matter distributions is already seen in real space, and it does not seem to be 
amplified by redshift distortion. 

The LOS densities of baryons and dark matter are qualitatively consistent with the 
expectation from Jeans length analysis, and statistics axe needed to quantify how well 
baryons and dark matter trace each other. Fig. 5.7 compares baryon and dark matter 
PS's in both real space and redshift space. At z = 3, the three-dimensional baryon PS has 
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already departed from the three-dimensional dark matter PS on scales below 6 Mpc 
{k >1 h Mpc""-*^), which is ten times larger than the Jeans length in mean-density regions. 
This discrepancy can be attributed to the fact that the vast volume of the universe is 
well below the mean density and have much larger Jeans lengths. Therefore, the three- 
dimensional baryon PS is somewhat lower than that of dark matter for A; > 1 /i Mpc"^, 
even though the Jeans length in mean-density regions corresponds to fcj 2± 10 /i Mpc~^. 
At z = 0, shocks have heated parts of the IGM to much higher temperatures and led to 
more reduction of the baryon PS with respect to the dark matter PS. 
The linear redshift distortion is first derived by Kaiser (1987), 



where P'^(k) is the three-dimensional mass PS in redshift space, (3 2± fl'^-^^z) (Lahav et 
al. 1991), and /x^i is the cosine of the angle between the LOS and the wavevector k. The 



which is boosted with respect to P{k) by a factor of 1.9 at high redshift. The nonlinear 
redshift distortion reduces the power of Fourier modes along the LOS, and the reduction is 
more severe on smaller scales. These effects are illustrated in Fig. 5.7. Both the constant 
boost on large scales and the reduction of power on small scales can be seen from three- 
dimensional mass PS's at 2; = 3. Whereas, the nonlinear scale of redshift distortion has 
evolved beyond the size of the simulation box at 2; = 0, so that the monopole of the 
redshift-space PS is always below the real-space PS. The nonlinear redshift distortion 
is also reflected in Fig. 5.5, where a real-space structure of p/p ~ 160 is smoothed to 
p/p ~ 20 in redshift space. Since the nonlinear redshift distortion is equivalent to a 
small-scale filter, differences between PS's are expected to be smaller in redshift space 
than in real space. However, the agreement of Po{k) between baryons and dark matter 
at z = 3 is probably coincidental. 

The real-space one-dimensional PS of baryons can differ significantly from that of 
dark matter even on large scales, because the one-dimensional mass PS is an integral 
of the three-dimensional mass PS. The difference should be a constant on scales that 
baryons have exactly the same three-dimensional PS as dark matter. In redshift space. 



P^(k) = (l+/3/X^)2p(k), 



(5.13) 



monopole of P'^(k) is 




(5.14) 
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the one-dimensional mass PS is 

1 1"^ 

^lD(^3) = 4^y^ P^(k^,A;3)dk^. (5.15) 

The anisotropy of the redshift-space three-dimensional mass PS has made redshift dis- 
tortion less intuitive for the one-dimensional mass PS. 

5.1.2.2 Equation of State 

Fig. 5.8 demonstrates the correlation between temperature and baryon density for the 
simulation HYDRO 1. The temperature-density relation is fairly tight at z = 3, and it 
follows the EOS T = ro(pb/Pb)°'^ closely. Two theoretical temperature-density curves 
for ionization equilibrium at z = 3 are also provided in Fig. 5.8. The UV background 
intensities of the two curves are adjusted to fit mean fluxes of 0.77 and 0.64. Although the 
UV intensities differ by a factor of 2.5, there is no visible difference between the two curves 
at Pb/Pb < 5. Compared to the simulation and the curves of ionization equilibrium, the 
simple EOS is indeed a good approximation. 

At 2; = 0, the IGM develops multiple phases; the bulk of the IGM has become cooler, 
while some gases are shock-heated to temperatures up to a few times 10^ K — the WHIM. 
These WHIM gases are generally too hot and too dilute to produce appreciable Lya 
absorptions. They explain why many high-temperature regions at z = (see the first 
two panels of Fig. 5.2) do not correspond to any absorptions at all, even though high- 
temperature regions would be naively thought of as high-density regions from the simple 
EOS. 

Note that the temperatures and densities in Fig. 5.8 are randomly selected from the 
density grid. Since smoothing lengths of SPH particles arc required to be at least the 
size of the grid spacing, one will not get a high-density tail that is often seen in a particle 
temperature-particle density plot. The sharp edges of temperature-density distributions 
are due to the internal UV background of the simulation, which keeps SPH particles from 
cooling below the ionization-equilibrium temperature. 



55 




Figure 5.8: The temperature-density relation of baryons. Grey dots are for z = 3, and 

black dots z = 0. Each group consists of 4000 pairs of temperature and density values 
randomly selected from the density grid of HYDRO 1. The dashed line represents an EOS 
T = 10^(pb/Pb)'^'^ K. The solid lines are calculated from ionization equilibrium with a 
factor of 2.5 difference in the UV background intensity. 
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Table 5.2: Methods for generating the Lya forest. 



Method 
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Ion. Eq. 


DE 
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OC Pd 


ToiPb/PhT 


OC {ph/phV 



" is the temperature used for thermal broadening, Tsph is 
the grid temperature based on temperatures of SPH par- 
ticles in hydrodynamical simulations, and Tie(pb) is the 
ionization-equilibrium temperature of a gas with density pb- 



5.1.2.3 Variants of the Pseudo-Hydro Technique 

Separately, Petitjean et al. (1995) developed a slightly different pseudo-hydro technique. 
They assume baryons to trace dark matter but do not use the simple EOS. Temperatures 
and optical depths of baryons are calculated from ionization equilibrium in their method 
(labeled as DI). As seen in Fig. 5.8, the ionization-equilibrium temperature is usually 
lower than the density-weighted SPH temperature. As such, small-scale fluctuations may 
not be sufficiently filtered by thermal broadening in this method. 

One may devise another variant of the pseudo-hydro technique by applying the simple 
EOS and equation (5.10) to baryons in hydrodynamical simulations. I refer to this method 
as BE and that of Croft et al. (1998) DE. Similarly, the name BI is given to the full- 
hydro dynamical approach described in Section 5.1.1. One can assess the importance of 
shocks by comparing methods BE with BI, while the difference between methods BE 
and DE must arise from the difference between baryon distributions and dark matter 
distributions. The four methods are summarized in Table 5.2. 

5.1.3 Compctrison 

To give a visual impression of pseudo-hydro techniques, I show in Figs. 5.9 and 5.10 
Lya forests obtained along the same LOS using the four methods, BI, BE, DI, and DE. 
The mean flux over all 256^ Lya forests by the four methods are required to match the 
observed mean flux of 0.97 at z = and 0.71 at z = 3, but the mean flux of a single 
LOS is not necessarily the same across the methods. Since a simple EOS does not take 
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Figure 5.9: Transmitted Lya flux based on baryon and dark matter distributions at z = 0. 
The three panels compare the same Lya forest generated from baryons in ionization 
equilibrium (BI) to those from baryons with an EOS (BE), dark-matter-converted baryons 
in ionization equilibrium (DI), and dark-matter-converted baryons with an EOS (DE). 
All the four methods are required to reproduce the same mean flux of 0.97. 



into account the substantial amount of WHIM at ^; = (see Fig. 5.8), pseudo-hydro 
techniques are expected to be less accurate at lower redshift. This is seen in Fig. 5.9. 
On the other hand, at ^; = 3, methods involving ionization equilibrium and the EOS 
generate almost identical Lya forests, and the difference between Lya forests generated 
from baryons and those from dark matter is also small. 
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Figure 5.10: The same as Fig. 5.9, except that Lja forests are generated from baryon 
and dark matter distributions at z = 3 and the mean flux is 0.71. 
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Figure 5.11: Flux PS's of Lya forests at z = and z = 3. The Lya forests are produced 
using the four methods listed in Table 5.2. Grey bands represent la dispersions of flux 
PS's of Lya forests generated from baryons in ionization equilibrium. The dispersions 
are calculated among 1000 groups, each of which consists of 64 LOS's. 

Fig. 5.11 evaluates the statistical performance of pseudo-hydro techniques using flux 
PS's. Grey bands are la dispersions of flux PS's of Lya forests produced using the 
method Bl. The dispersions are calculated among 1000 groups, each of which contains 
64 LOS's randomly selected with no repetition. There is a good agreement among all 
methods at z = 3, but all the three pseudo-hydro methods, BE, DE, and DI fail to 
converge on BI at z = 0. The method DI seems to work better than the other two on 
scales k < 5 h Mpc~^ at z = 0. The excess of flux power for pseudo-hydro techniques 
is expected because they all underestimate the IGM temperature and produce more flux 
fluctuations than the full-hydro method (see Figs. 5.8 and 5.9). In fact, methods BI 
and BE have identical baryon distributions, so that the difference in their flux PS's can 
only be attributed to the IGM temperature. Hence, one concludes that the temperature 
structure of the IGM is critical to the Lya forest and flux PS at low redshift. 

The mean-density temperature of the IGM Tq does not affect the optical depth in 
methods BE and DE because it is absorbed into the parameter A in the approximation 
F = exp [—A{p/p)'^], which is adjusted to fit the observed mean flux. However, Tq can 
affect simulated Lya forests through thermal broadening as indicated by the fast drop 
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Figure 5.12: The same as Fig. 5.11, except that for methods BE and DE Lya forests are 
generated with Tq = 15000 K. Methods BI and DI are not affected by Tq. 



of the flux PS's for methods BE and DE. To test this, I reproduce Lya forests using 
To = 15000 K, which is roughly three times (1.5 times) the mean-density temperature 
of the IGM in HYDROl at z = (z = 3). Flux PS's of these Lya forests are shown in 
Fig. 5.12. One sees that the higher mean-density temperature reduces more flux power 
on small scales while leaving flux PS's unchanged on large scales. As such, the flux PS is 
less robust on small scales as a constraint for cosmology. The small-scale excess of flux 
power for the method DI with respect to BI is also due to thermal broadening because 
the ionization-equilibrium temperature is almost always lower than the density-weighted 
temperature of SPH particles. 
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5.2 Tuning the Lya Forest 

It is already seen in last section that pseudo-hydro techniques do not work well at low 
redshift and their performances are not all equal. Simulated Lya forests are affected by 
many elements including, for instance, the EOS (for methods BE and DE) and the mean 
flux. If they are to be compared with the statistics of observed Lya forests and to be 
used to determine cosmological parameters, one must understand the dependence of the 
statistics of simulated Lya forests on the above-mentioned elements. 

5.2.1 Equation of State 

The EOS maps density fluctuations to flux fluctuations by relating optical depths to 
densities. For a given density and mean flux, different EOS's will assign different optical 
depths, which alters amplitudes of flux fluctuations and, therefore, the flux PS. 

For a stiff EOS, i.e. a small value of 7, high-density regions have to absorb less Lya 
flux, while, in compensation, low-density regions have to absorb more. In terms of flux, a 
stiff EOS leads to higher fluxes in deep (or large-equivalcnt-width) absorptions and lower 
fluxes in shallow absorptions than a soft EOS does. This expectation is conflrmed in 
Figs. 5.13 and 5.14, where Lya forests generated using methods BE and DE are compared 
with those using the method BI at z = 0, 1, 2, and 3. The mean flux is kept the same 
for all the methods used here at each epoch, while only the EOS is varied. The value of 
7 = 1.4 in the figures corresponds to an unrealistically stiff EOS, and it is provided only 
for the purpose of comparison. 

Figs. 5.13 and 5.14 show that low-amplitude and small-scale fluctuations in the flux 
are likely to be suppressed for methods BE and DE, while high-amplitude fluctuations 
are likely to be amplified. Overall, this boosts the flux PS on large scales with respect to 
that of the method BI as observed in Fig. 5.11. Although methods BE and DE are not 
good approximations at low redshift, they work remarkably well with 7 = 1.6 at 2 = 3. 

Since the amplitude of flux fluctuations increases with 7 in Figs. 5.13 and 5.14, a 
smaller value of 7 must lead to a lower flux PS. This is observed in Fig. 5.15, where flux 
PS's of Lya forests obtained using the method DE with different EOS's are compared 
with those using the method BI. Methods DE and BE produce very similar Flux PS's, 
so flux PS's of method BE are not shown separately. Fig. 5.15 demonstrates that one 
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Figure 5.13: Lja Forests generated from baryon and dark matter distributions with 
different EOS's. The upper two panels are for z = 0, and the lower two z = I. The 
Lya forests are produced from baryon densities and dark matter densities using F = 
ex.p[—A{p/py], where A is adjusted to fit the mean flux F. Thermal broadening is 
included with temperature given by T = To(p/p)". 
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Figure 5.14: The same as Fig. 5.13, except that the upper two panels are for -Z = 2, and 
the lower two z = S. 
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Figure 5.15: Flux PS's of Lya forests generated from dark-matter-converted baryon 
densities using different EOS's. Grey bands represent la dispersions of the flux PS's 
of Lya forests generated from baryons in ionization equilibrium. The dispersions are 
calculated in the same way as in Fig. 5.11. 
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cannot tune the EOS to make pseudo-hydro techniques work at low redshift. Meanwhile, 
pseudo-hydro techniques appear to be a good approximation for studies of the flux PS 
at z = 3. The difference among different EOS's is also less pronounced at 2; = 3, because 
the dynamical range of p/p is much smaller. 

5.2.2 Mean Flux 

The mean flux affects the Lya forest and the flux PS in a simple way. Low-density regions 
of the IGM cannot absorb much Lya flux no matter what mean flux is required. Thus, 
the mean flux mostly affects regions where absorption is already significant. Specifically, 
a higher mean flux weakens existing absorptions and decreases the flux PS over all scales. 
The Lya forest should be more sensitive to the mean flux at lower redshift when Lya 
absorptions often arise from denser regions. 

Fig. 5.16 compares Lya forests obtained along the same LOS but with different mean 
fluxes given by observational and simulational uncertainties [see equation (5.4)]. Since 
all the four methods in Table 5.2 have the same dependence on the mean flux, I only 
show flux PS's of Lya forests generated using the full-hydro method. As expected, the 
mean flux monotonically alters Lya forests in all regions with greater changes in deeper 
absorptions, and a lower mean flux gives rise to stronger fluctuations in the flux. If one 
takes into account that the diflFerence in the mean fluxes at z = is about 20 times 
smaller than that at z = 3, the low-redshift Lya forest does seem to be more sensitive to 
the mean flux. 

Fig. 5.17 shows that flux PS's are also monotonically altered by the mean flux. A 
lower mean flux (more absorptions) leads to a higher flux PS on all scales. Unlike the 
EOS, the mean flux does not change the shape of the flux PS much. This implies that 
the mean flux can uniquely determine the normalization of the flux PS. In other words, 
the mean flux is a relatively robust constraint on simulated Lya forests. 
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Figure 5.16: Simulated Lya forests with mean fluxes from equation (5.4). The Lya forests 
are generated from baryons in ionization equilibrium. Note that the difference in mean 
fluxes increases with redshift. 
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Figure 5.17: Flux PS's of simulated Lya forests with mean fluxes from equation (5.4). 
The Lya forests are generated from baryons in ionization equilibrium. The grey band in 
each panel is the la dispersion of the flux PS with the medium mean flux. The dispersions 
are calculated among 1000 groups, each of which consists of 64 LOS's. 
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5.3 Mass Statistics vs. Flux Statistics 
5.3.1 Power Spectrum 

The Lya forest has been used to infer the hnear mass PS of the cosmic density field. The 
nonhnear transform of the density to the flux has made it difficult to derive the mass PS 
from the flux PS analytically. One way to circumvent the difficulty is to use simulations 
to map the flux PS to the mass PS (Croft et al. 2002). As such, it is important to compare 
the flux PS with the mass PS. 

Plotted in Figs. 5.18 and 5.19 are flux PS's of Lya forests produced from baryons 
and dark-matter-converted baryons in ionization equilibrium along with one-dimensional 
mass PS's of baryons and dark matter. One a dispersions of baryon flux PS's and mass 
PS's are shown in grey. The most prominent feature is that one-dimensional mass PS's 
have much larger dispersions than flux PS's. As discussed in Chapter 4, the variance in 
the one-dimensional mass PS is severely inflated by the trispectrum of the cosmic density 
field because of the non-Caussianity. 

An interesting observation is that unlike the mass PS the flux PS decreases with 
time. This is due to the thinning of the Lya forest and the higher mean flux toward 
lower redshift that reduce fluctuations in the Lya flux. 

The nonlinear transform between baryon density and flux greatly suppresses the fluc- 
tuations. For example, the overdensity 6 may vary from -1 to hundreds (tens) at z = 
(2 = 3), but the flux can only be between and 1. With a mean flux on the order of 
unity, fluctuations in the flux are 10-10^ times weaker than those in the cosmic density 
field. Hence, the fiux PS is a factor of 10^ {z = 3) to 10^ {z = 0) times lower than 
the one-dimensional mass PS. Moreover, the non-Gaussianity in the cosmic density field 
is even more suppressed in the flux because it is a higher-order effect. Thus, the flux 
trispectrum is much closer to zero as compared to the mass trispectrum of the cosmic 
density fleld, and the variance of the flux PS becomes much smaller than the variance of 
the one-dimensional mass PS. 

The near-Gaussian Lya flux is probably the reason that many simulations and tech- 
niques are able to reproduce lower-order statistics of the observed Lya forest, especially 
at high redshift. A potential problem arises because of Figs. 5.18 and 5.19. That is 
one could produce Lya forests from wildly different density fields but still have almost 



69 

= — I — I I I I I 1 1 1 1 — I — I I I I 1 1 1 r^p — I — I — I I I I 1 1 1 1 — I — I I I I I I 1 




k {h Mpc-i) 

Figure 5.18: Flux PS's of simulated Lya forests and one-dimensional mass PS's of the 
underlying density fields at z = and 1. 

identical flux PS. For instance, even though baryons and dark matter differ considerably 
in terms of mass PS (see also Fig. 5.7), they are not so distinguishable from each other 
in flux PS's at z > 2. Conversely, wc arc able to measure the flux PS extremely well, but 
how much confidence do we have in recovering the underlying mass PS? 

5.3.2 Covariance 

Higher-order statistics may be able to break the degeneracy. Here I employ the covariance 
of the one-dimensional PS to explore the difference between Lya forests generated using 
the full-hydro method BI and those using the pseudo-hydro method DI. 

Figs. 5.20 and 5.21 illustrate normalized covariances C{ks,k'^) and C{ks,k'^) of one- 
dimensional mass PS's and flux PS's, respectively, at z = 0. The covariances are calcu- 
lated from 1000 groups, each of which consists of 64 LOS's (iV = 64) randomly selected 
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Figure 5.19: Flux PS's of simulated Lya forests and one-dimensional mass PS's of the 
underlying density fields at ^; = 2 and 3. 

from the density grid of HYDROl. For GRFs, both C{ks, k'^) and C{ks, k'^) are diagonal 
with C{ks,k'^) = S^^^, and C{ks,k'^) = N~^5^^^,. For better comparison, the covari- 
ances C{k^, k'^) are multiplied by N, so that the Gaussian case has N C{kz-, k'^) = S^^ ^, . 
As already seen in Chapter 4, the covariances of one-dimensional mass PS's are starkly 
non-Gaussian. The normalized variances C{k3,k2) are two orders of magnitude higher 
than expected for GRFs. The covariances of baryons are roughly a factor of 2 lower than 
those of dark matter. This is likely due to the larger smoothing radii of SPH particles that 
filter out more small-scale non-Gaussian fluctuations. The covariances of flux PS's have a 
dominant diagonal, though they are still not Gaussian. The difference in the covariances 
between the full-hydro method BI and the pseudo-hydro method DI is comparable to the 
difference in their flux PS's. The method BI gives rise to stronger correlations between 
high-A; modes in the flux PS than the method DI does because of the WHIM. 
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Figure 5.20: Normalized covariances of onc-dimcnsional baryon and dark matter PS's at 
z = 0. The upper panels are covariances C{k3,k'^) in the same grey scale as Fig. 4.4. 
The lower panels are cross sections of covariances C{ks, k'^) along Q = {k^ + k'.^)/h~^M.^c. 
The covariances C(A;3, k'^) are multiplied by 64 for better comparison with that of GRFs, 
which follows 64C(A;3, k'^) = 5^^^^, . All the covariances are calculated from 1000 groups, 
each of which consists of 64 LOS's {N = 64) randomly selected from the density grid of 
HYDRO 1. 
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Figure 5.21: The same as Fig. 5.20, but for flux PS's of Lya forests generated from 
baryons and dark matter using methods BI and DI. 
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Figs. 5.22 and 5.23 present normalized covariances of one-dimensional mass PS's and 
flux PS's for the simulation HYDR02 at z = 3, which are very similar to those for 
HYDRO 1. At this redshift, the covariances of one-dimensional mass PS's are reduced by 
a factor of a few, but they are still highly non-Gaussian. Whereas, the covariances of flux 
PS's are very close to Gaussian, and the difference between the two methods BI and DI 
is greatly reduced compared to that at z = 0. 

In addition to simulations, I show in Fig. 5.24 normalized covariances of observed flux 
PS's at z = 3. The sample of Lja forests includes 20 QSO spectra from Bechtold (1994) 
and Dobrzycki & Bechtold (1996). The QSO spectra are selected so that each contains at 
least one good chunk of spectrum that has no bad pixels or strong metal lines and spans 
64 A anywhere within z = 2.9-3.1. The spectral resolution is 1 A, which is about four 
times lower than that in the simulations. In all, there arc 27 segments of Lyct forests for 
the analysis. I do not re-group the segments, i.e. N = 1, in calculating the covariances. 

The main characteristics of the observed covariances are in good agreement with 
simulated ones. Namely, the normalized covariance matrices have a strongly dominant 
diagonal, and they are very close to Gaussian. The values of diagonal elements seem 
to match those in simulations, but the off-diagonal elements are a little noisier because 
of the small sample size. With a large number of high-resolution Lya forests, one will 
be able to study the behavior of the covariance on smaller scales (larger k) and reduce 
statistical uncertainties. 

A general observation of covariances of flux PS's is that the correlation between two 
LOS modes decreases away from the diagonal, since two neighboring modes are more 
likely to be correlated than two distant modes. Beyond this point, however, the behavior 
of the covariances is not quantitatively understood. It also seems that the covariance of 
the flux PS does not provide more leverage for differentiating the underlying density field 
than the flux PS itself. This echos the finding by Mandelbaum et al. (2003) that the flux 
trispectrum does not provide much extra information than the flux PS. 
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Figure 5.22: The same as Fig. 5.20, but for the simulation HYDR02 at z = 3. 
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Figure 5.24: The same as Fig. 5.23, but for observed Lya forests. The covariances are 
calculated from 27 segments of Lya forests with = 1. Note that the resolution of the 
observed Lya forests is about four times lower than that in the simulations. 
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5.4 Cosmology 

Because of the difficulty in deriving density statistics from flux statistics, one often resorts 
to numerical simulations and constrains cosmology by comparing observed flux statistics 
directly to simulated flux statistics. In addition, one utilizes fast TV-body simulations 
and pseudo-hydro techniques in order to explore a large cosmological parameter space 
in manageable time. This necessitates an examination of the accuracy of pseudo-hydro 
techniques and the sensitivity of flux statistics to cosmology. 

Figs. 5.25 and 5.26 compare mass PS's and flux PS's for six simulations: HYDROl, 
HYDR02, LCDM, high-n LCDM (HIGHn), high-dg LCDM (HIGHct), and open CDM 
(OCDM). Table 5.1 lists parameters for all the models. The A?^-body simulations all 
have the same box size of 22.222 h~^Mpc on each side and evolve 128^ CDM particles 
from z = 49 to using GADGET. Note that the HIGHn model has an opposite tilt 
than HYDROl and HYDR02. Not all the simulations are consistent with most recent 
observations, and they are provided only for testing the cosmological dependence of the 
flux PS. 

Pseudo-hydro techniques have already been proven inaccurate at low redshift by sev- 
eral tests above. I include the results at z = here only to show that all the flux PS's 
based on the method DI are nearly indistinguishable from each other except the HIGHa 
model. 

At 2 = 3, the LCDM model is replaced by HYDR02. Interestingly, HYDROl and 
HYDR02 simulations have identical one-dimensional mass PS's and three-dimensional 
mass PS's. Their flux PS's differ on small scales. This is due to the factor that HYDR02 
has, on average, a higher IGM temperature, and it might not be directly resulted from the 
difference in cosmological parameters. There is also a considerable difference of flux PS's 
between A/"-body simulations and hydrodynamical simulations at A; < 1 ^ Mpc"^, but 
such difference is already present in Fig. 5.11 where pseudo-hydro techniques are applied 
to dark matter in the simulation HYDROl itself. Therefore, it cannot be attributed to 
the cosmological model. The only AT-body simulation that significantly deviates from 
HYDROl is OCDM— assuming that the method DI works equally well for OCDM. 

On scales below a few h~^Mpc {k > a few h Mpc^^), Fig. 5.12 suggests that detailed 
knowledge of the state of the IGM is needed in order to correctly reproduce the flux 



78 




k {h Mpc-i) k {b Mpc-') 

Figure 5.25: Mass PS's of baryons and dark matter and flux power spectra of simulated 
Lja forests for five cosmological models &t z = 0. The upper left panel shows one- 
dimensional mass PS's, and the lower panel three-dimensional mass PS's. The upper 
right panel shows flux PS's, and the lower panel residuals of flux PS's with respect to the 
flux PS of Lya forests generated with baryons in ionization equilibrium from HYDRO 1 
(light grey lines and bands). All others flux PS's are from AT-body simulations using the 
method DI. 

PS. On large scales, Fig. 5.26 reveals significant differences between cosmological models 
and between the full-hydro method and pseudo-hydro techniques. Therefore, to constrain 
cosmology using the flux PS and iV-body simulations, one should have precise calibrations 
of pseudo- hydro techniques and focus on scales above a few h^^Mpc. 

It is seen that the flux PS of the 27 Lya forest segments is roughly matched by all 
the simulations at z = 3. The scatter of the observed flux PS is too large to be useful for 
determining cosmological parameters because of the small size of the sample. But with 
a much larger number of Lya forests, it is possible to reduce the sample variance error 
of the flux PS and place meaningful constraints on cosmology. 



79 



ft 



Q 

or 1 



102 



0.1 



1 1 


1 1 1 1 1 


1 1 1 


1 1 1 1 1 1 1 

z = 3 - 




HYDRO 1 


Ba. 






HYDROS 


Ba. 






HIGHn 






"ill 


HIGHct 

OCDM 




W 


\ V 
' v\ 
- V 


1 1 ll[ 1 


1 — h-l-+ 


++++] h-H-^ 










1 1 


1 1 1 1 1 


1 1 1 


, ,mI Vk\ , = 



1 10 

k {h Mpc-') 




1 10 

k {h Mpc->) 



Figure 5.26: The same as Fig. 5.25 except that dark-matter-based PS's from the sim- 
ulation LCDM is replaced by baryon-based PS's from HYDR02, and that all PS's are 
at z = 3. The observed flux PS is measured from 27 segments of Lya forests between 
z = 2.9 and 3.1. 
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CHAPTER 6 



INVERTING THE LYa FOREST 

There are two ways of obtaining density statistics from the Lya forest. One is to measure 
flux statistics and then map them into density statistics. The other is to measure statistics 
of densities inverted from the Lya forest. As discussed in previous chapters, pseudo- hydro 
techniques are good approximations at z = 3, but they still require precise calibrations 
using hydrodynamical simulations. On the other hand, it is not practical to search 
the parameter space using hydrodynamical simulations that incorporate all important 
astrophysical processes. Therefore, it is worthwhile exploring methods of inverting the 
Lja forest. 

Baryon densities and mass densities p may be extracted from transmitted fluxes F 
of Lya forests using the same equation that is used in the pseudo-hydro technique DE, 
i.e. 

More accurately, one should also include ionization equilibrium, thermal broadening, etc. 
A question may be raised here: given the uncertainty of pseudo-hydro techniques, what 
can be gained by inverting the Lya forest using equation (6.1)? It is seen in Figs. 5.9 and 
5.11 that the lack of temperature information is the major source of error for pseudo- 
hydro techniques. Whereas, one could determine temperatures of observed Lya forests 
from line profiles and obtain relatively accurate baryon densities. Then, the mass PS of 
baryons on large scales can be used to constrain cosmology. 

When the density is high enough, the spectrum is saturated, i.e. F ~ 0. With noises 
and uncertainties in the spectrum, a direct inversion using equation (6.1) is very unreliable 
in saturated regions. Despite the difficulty, methods of direct inversion are systematically 
developed, for example, with Lucy's method by Nusser & Haehnelt (1999), and with 
Bayesian method for a three-dimensional inversion by Pichon et al. (2001). One may also 
use higher order lines to recover the optical depth and the underlying density (Cowie 
& Songaila 1998; Aguirre, Schaye &; Theuns 2002), even though the contamination by 
lower order lines needs to be carefully removed. Once LOS densities are obtained, many 
statistics, such as the one-dimensional mass PS, can be measured. 
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The saturation problem is avoided if one maps the mass PS directly from the flux 
PS of the Lja forest without an inversion (Croft et al. 1998, 1999, 2002). However, a 
close examination of the Fourier transform of equation (6.1) shows that Fourier modes 
on different scales are mixed by the nonlinear density-flux relation (see Section 6.3). The 
mixing depends on the underlying density field, and it is hard to predict analytically. 

If the inversion is necessary, a proper treatment in saturated regions has to be devel- 
oped. In many physical systems, sizes are often correlated with other quantities such as 
masses and densities. For example, more massive stars or dark matter halos have larger 
sizes, but lower mean densities (Binney 8z Merrifield 1998; Navarro, Frenk & White 1996). 
One may expect a similar trend for the saturated Lya absorption. On the contrary, the 
mean density is found to increase with the width of saturation. This is due to the fact 
that the IGM is very diffuse and far from virialization, while the other objects mentioned 
above are the opposite. 

In this chapter, densities are expressed in units of their corresponding cosmic mean 
densities, and the subscript for LOS wavenumber is dropped for convenience. 

6.1 Correlation between the Mean Density and Width of 
Saturated Lya Absorptions 

Neglecting the probability that two physically separate absorption systems fall in the 
same rcdshift coordinate, one can associate a saturated absorption in the Lya forest with 
a single high density region. If I assume further that there is no substructure present, and 
the IGM evolves more or less the same way everywhere, then the size of the saturated 
region has to be tightly correlated with its mean density (see also Schaye 2001). In reality, 
the neglected elements above and uncertainties elsewhere will introduce a spread to the 
correlation. 

6.1.1 Simulations 

Figs. 5.10, 5.19, and additional tests at z = 4 demonstrate that pseudo-hydro techniques 
work very well at 2; > 2. Therefore, I choose N-hody simulations to investigate the 
relation between the mean density and width of saturated Lya absorptions. 

A standard particle-particle-particle-mesh (P^M, Hockney & Eastwood 1981) code 
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Table 6.1: Parameters of the AT-body simulations. 



Model 


9. 


Ha 


/) 


r " 


0-8 


LCDM 


0.3 


0.7 


0.7 


0.21 


0.85 


OCDM 


0.3 





0.7 


0.21 


0.85 


SCDM 


1.0 





0.5 


0.5 


0.67 


TCDM 


1.0 





0.5 


0.25 


0.60 



" r is the shape parameter of the PS. 



developed by Jing & Fang (1994) is used to evolve 128^ dark matter particles in a cubic 
box of 12.8 h~^Mpc (comoving) each side. The initial PS is given by the fitting formula 
from Bardeen et al. (1986). The model parameters are listed in Table 6.1. All the models 
start from z = 15 and stop at z = 3 in 950 steps. 

The parameter A in equation (6.1) is chosen to fit the mean flux equation (5.4). I 
extrapolate the mean flux up to z = 4.5, which is not critical to the analysis in Section 
6.1.2 but is, nevertheless, supported by simulations (Riediger et al. 1998). The constant 
7 in equation (6.1) is set to 1.6 since it is the best fit in Fig. 5.14. Four samples of the 
Lya forest arc shown in Fig. 6.1. They arc drawn from the LCDM simulation at z = 3. 
The corresponding LOS densities are plotted along with the fluxes. It is evident that 
most of the Lya lines arise where p < 10. 

6.1.2 The Correlation 

I define the width w of the saturation as the distance between the two points that bracket 
the absorption at a given threshold flux level rj. In other words, it is the width of a region 
in which F < r). Since the flux F in the saturated region is dominated by noise, one 
should set the threshold above the noise level. That is why I have not added any noise in 
the simulations: noise outside the saturation can be routinely dealt with, while noise in 
the saturation, being below the threshold, does not matter. Within a reasonable range 
of spectral quality, I choose the threshold rj = 0.03 and 0.10 for tests below. 

Once the width is determined, the mean density pa can be readily calculated from the 
corresponding LOS density field. Since the density and the flux are assigned only on grid 
points, interpolation is needed to find w and pa. Fig. 6.2 shows the scaling relation for the 
LCDM model at z = 3. It clearly demonstrates that pa increases with w. Furthermore, 
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Figure 6.1: Simulated Lya forests and LOS densities from the LCDM model at z = 3. 
The solid lines are the flux F, the dashed lines are the matter density p, the dotted lines 
are the recovered density, and the horizontal dash-dotted lines indicate the threshold flux 
level rf = 0.03 (see Section 6.2) in the upper 2 panels, and ry = 0.10 in the lower 2 panels. 
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Figure 6.2: The correlation between the mean density pa and width w of saturated regions 
in the Lya forest. The trend Hues fit the data well beyond 2 /i^^Mpc, even though they 
are obtained by fitting only the data with w < 2 h~^Mpc. The data above 0.3 /i~^Mpc 
are binned in logarithmic intervals for readability. Similar treatments apply to Figs. 6.4, 
6.5, 6.6, and 6.9 as well. 
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Table 6.2: Parameters in the fitting p^, = po + a {w/h ^Mpc)** for the LCDM model. 



z 


r? 


Po 


a 


b 


3 


0.03 


1.93 


3.85 


0.545 




0.10 


1.39 


3.39 


0.564 


4.5 


0.03 


0.11 


2.78 


0.455 




0.10 


0.24 


2.04 


0.556 



the correlation is reasonably tight for narrow saturations. Note that the trend lines are 
obtained by fitting only the data with w <2 /i~-'^Mpc, because wider saturations are very 
rare as compared to the rest. 

The relation is well fitted by 

Pa = + a (w;//i"^Mpc)^ (6.2) 

The term po sets a baseline for the scaling, because the matter density is non-vanishing 
even if there is no saturation. The parameter a is a scaling coefficient, which reflects the 
overall amplitude of the density fluctuation, and the exponent h is more or less determined 
by the nature of hierarchical clustering. The width w is in units of /i~^Mpc. Table 6.2 
lists the values for the LCDM model at 2; = 3 and 4.5. The reason why po decreases with 
redshift is that the universe is more uniform early on, so that even low density regions 
have to absorb a substantial amount of Lya flux to produce the low mean fiux. This 
is possible because the neutral fraction of the IGM at z = 4.5 is higher than that at 
z = ?). As the universe evolves, density fluctuations grows stronger and stronger, and the 
scaling coefficient a becomes larger and larger. The exponent h has changed little over 
z = 3-4.5. The parameters also show a dependence on the threshold flux rj, because rj 
sets the threshold density p^, above which the Pa^ relation is explored. 

Fig. 6.3 shows the evolution of the Pa,-w relation for the four models. The difference 
between models is mostly due to the amplitude of fluctuations — in other words, the PS. 

6.1.3 A Simple Model 

The Pa^ relation is analogous — but not completely equivalent — to the curve of growth 
(e.g. Press & Rybicki 1993), which studies the correlation between the Hi column density 
A^Hi and the equivalent width W of Lya absorptions. The similarity is as follows. The 
width w is approximately the same as W, and the mean matter density pa is proportional 
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Figure 6.3: Best-fittings of the pa-w relation for the four models at z = 3-4.5. The 
threshold rj = 0.03. The parameters are obtained by fitting the saturations with w < 
2 ^-^Mpc. 

to w^^Nui if p oc pb holds true. Since A^hi oc at small values of T^, it is not surprising 
to see Pa grow slowly, i.e. dlnpa/dlnii; ~ 0, for small w. 

On the other hand, the Pa^ relation addresses saturated absorptions where, accord- 
ing to the curve-of-growth analysis, W is almost a constant independent of the H I column 
density. Therefore, pa would have risen steeply against w, i.e. 6 S> 1. This apparent in- 
consistency arises from the cosmological context, because the width w is determined not 
only by A^^hi but also by the physical extent of the (relatively) dense region through 
Hubble expansion and peculiar velocities. 

Although it is not obvious why the exponent b is less than unity, I can make an order- 
of-magnitude estimate using the density profile p(r) oc from a spherical self-similar 
infall. I modify the profile to avoid the singularity at r = by adding a smoothing length 
e, so that p(r) oc (r^ -|- e^)~^/^. The mean density within the radius R, at which p = p^. 



87 



IS 



R\/B? + e2 + log 



(6.3) 



Prom equation (6.3), one gets an estimate 6 ~ dlnpa/dlni? = 0.39, 0.42, and 0.43 for 
e = 0.2, 0.3, and 0.4 /t~^Mpc respectively at i? = 2 /i~^Mpc. The same quantity for the 
PaT^ relation is dlnpa/dlnii; = 0.41, where I have used the parameters of the LCDM 
model at z = 3 with -q = 0.03 (see Table 6.2). It should be emphasized that the profile 
p(r) oc r"^ is not quite justified for the IGM at 2; = 3, and without any modification, it 
gives the same mean density within the boundary p = Prj regardless the size of the system, 
i.e. 6 = 0. Therefore, equation (6.3) is not expected to give a good approximation of the 
Pa.-w relation. 

6.2 Inversion with A Gaussian Density Profile 

If the flux level is above the threshold, equation (6.1) can be used to find the density. 
When it is below the threshold, one must provide a density profile that matches P3,{w) 
and prf to fill in the missing information in the saturation. 

While the density profile is worth studying in its own right, I simply choose the 
Gaussian profile 

•S0x2 



P{s) 



B 



exp 



-(- 

2^ a 



-y 



(6.4) 



aV2^ 

where s is the coordinate in redshift space, sq is the center of the saturation, and B and 
a are solved simultaneously from 

(•so+w/2 



ps.{w) = - p(s)ds = — erfl — =- 1, 

wJsa-w/2 w V2V2 0-/ 



Pn = p{so ± w/2) 



(6.5) 



B 



: exp 



8 



a^/2^^ 

where erf(a;) is the error function. The Gaussian profile has the advantage that it does 
not introduce any artificial power on small scales. However, it is arguable that the right 
amount of small-scale power should be added through the profile, and so a more realistic 
profile may be needed. 
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Figure 6.4: One-dimensional mass PS's of original densities and inverted densities in 
redshift space. Circles include all LOS's from the N-body simulation, and the dashed 
line contains only the ones that have a maximum width of saturation wm < 1 h~^Mpc. 
The recovered densities are inverted from fluxes with thresholds rj = 0.03 (solid line) and 
77 = 0.1 (dotted line). The error bars are la dispersions among 610 groups, each of which 
consists of 20 LOS's. The error bars of the recovered densities, which are not plotted, 
are comparable to that of the original densities. 



The recovered LOS densities with 77 = 0.03 and rj = 0.10 are shown along with the 
four original LOS densities and fluxes in Fig. 6.1. Since a universal density profile is 
employed in the inversion, the recovered densities do not necessarily match the original 
densities. 

To assess the statistical quality of the inversion, I plot in Fig. 6.4 the original and 
recovered mass PS's of the LCDM model with different flux thresholds. Other models are 
shown in Fig. 6.5. The recovered PS has good agreement with the original PS on large 
scales (fc < 3 hMpc~^), but it is underestimated on smaller scales, where the Gaussian 
profile essentially has no power. The signal to noise ratio, or the threshold flux, has little 
influence on large scales, but a low noise level does slightly improve the recovery on small 
scales. 
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Figure 6.5: The same as Fig. 6.4, but for OCDM, SCDM, and TCDM models. 
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Figure 6.6: The same as Fig. 6.4, except that the fluxes are drawn from the LCDM 
simulation, while the densities are recovered with pa,-^ relations from OCDM and TCDM 
simulations. 



For comparison, I show in Figs. 6.4 and 6.5 the original PS of the LOS densities that 
have a maximum width of saturation wm < 1 /i~^Mpc. This is equivalent to removing — 
or giving less weight to — the saturated regions when one measures the PS. It is seen that 
the saturated regions are very important to scales > 2 ^~^Mpc, while the unsaturated 
regions give a good estimate of the small-scale power. Thus one may improve the recovery 
of PS as follows. First, invert the Lya forest with equation (6.1) and the Pa,-w relation. 
Second, do the inversion after removing the saturated regions. Finally, the best-estimate 
of the PS is just the common envelope of the PS's of the two recovered densities. 

Fig. 6.3 indicates that the Pa,-w relation varies from model to model. Thus it is 
necessary to check if the recovery is sensitive to Pa,{w) — in other words, if it is model 
dependent. In Fig. 6.6 I plot the PS's of LCDM densities recovered with Pa,,o{w) and 
Pa,T(w) from OCDM and tilted cold dark matter (TCDM) models respectively. It seems 
that by boosting pe,{w) a small amount [pa.,T{w) lies a little higher than /)a,L('fi')]; one 
gets even better estimate of the PS on small scales. However, lowering Pa,{w) could 
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underestimate the power by a factor of 2 on large scales, and even more on small scales. 
The effect of this model dependence could be reduced with the constraint on small scales, 
since it is possible to recover the small-scale power well by removing the saturations. 

Fig. 6.7 tests a different statistics — the one-point distribution function for the LCDM 
model. It is evident that the Gaussian profile leads to a drop of the probability at high 
densities. In other words, it statistically reduces the heights of density peaks. 



6.3 Mapping the Power Spectra 

It seems that the inversion is no longer needed at least for determining the mass PS if 
one establishes a direct mapping between the flux and the mass PS's (Croft et al. 2002). 
It is shown that given a set of cosmological parameters, there is a statistical mapping, 
which reliably recovers the mass PS from the flux PS (Gnedin &; Hamilton 2002), even 
though the mapping is model dependent. 

The physical links between the three-dimensional linear mass PS Pi,{k) and the flux 
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Figure 6.8: Three-dimensional mass PS's of the LCDM model at z = 3. The dashed lines 
are the real-space PS's, the dotted lines are the redshift-space PS's averaged over solid 
angles, and the thin solid line is the initial PS (at z = 15) evolved to z = 3 by linear 
theory. The thin dashed line and the thin dotted line are from an additional simulation 
with the same parameters as the LCDM model except that the box is 51.2 h~^Mpc each 
side. 

PS can be summarized by the flowchart: 

where P^hik) is the three-dimensional nonlinear mass PS (Peacock &; Dodds 1996; Taylor 
& Hamilton 1996), P^i^{li) is Pnl(A:) in redshift-space (Kaiser 1987; Peacock 1999; Zhan 
& Fang 2003), and (f{k) = PF{k)/Pf-^{k). Note that P^j^{\s.) is anisotropic. I show in 
Fig. 6.8 that the nonlinear evolution and redshift distortion are already significant at 
z = 3. The fact that the departure from linear evolution is very similar in both large 
and small box simulations indicates that such departure is real, and the cosmic density 
field has already gone nonlinear below 10 h~^Mpc {k > 0.6 h Mpc~^) at z = 3 (see 
also Pichon et al. 2001; Zhan et al. 2001). The angularly averaged three-dimensional PS 
-^nl(^)' monopole, does not give a complete view of the difference between the 
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real-space PS and redshift-space PS, but it does show that peculiar velocities boost the 
power on large scales and reduce it substantially on small scales as seen in Fig. 5.7. A 
two-dimensional projection of PffL(k) can be found in Peacock et al. (2001). 
In contrast, the mapping in Croft et al. (2002) follows a simplified path: 

where Pid,l(A;) = {21^)-^ I\,{y)ydy, and h'^{k) = -Ft(A;)/PiD,L(A;). Given a one- 
dimensional density field in redshift space at z = 3, one can only measure Pfj){k), and so 
Pid,l(A;) is not observable unless the density field is linear on all scales and no peculiar 
velocity is present. In other words, Pf^ik) is directly connected to Ppik), while PiD,L(fc) 
is not. 

Around the cosmic mean matter density, i.e. \6\ <C 1, equation (6.1) is approximately 

F{s) ~ e-^ - A-fe-^S{s), (6.6) 

where I have included the dependence on the redshift coordinate s explicitly. It is seen in 
the linearized equation (6.6) that the Fourier modes of the flux are proportional to those 
of the overdensity when k ^ 0. Hence, the flux PS is proportional to the mass PS 

ft(A;) ~ AVc^^^AdW, k^O. (6.7) 

Caution should be taken when using equation (6.6), because it is valid only if one smoothes 
the Lya forest over large scales (see, for example, Zhan et al. 2001; Zhan & Fang 2002). 

It is obvious from equation (6.1) and Fig. 6.1 that the condition \S{s)\ ^ 1 is not 
satisfied when P 2± 1 or F ~ 0. Therefore, one has to include higher order terms of 
S{s). Taking the simplest case, in which S{s) = sin(A;os) and only S^{s) term is added to 
equation (6.6), one immediately finds that Ppik) contains spurious power on the mode 
2ko that Pfj){k) does not contain. In general, the nonlinear density-flux relation distorts, 
spreads, and mixes power in the cosmic density field over different scales in the flux 
PS. I have not focused other sources of distortion such as linear filtering (Gnedin &; Hui 
1998), line profile (see Gray 1992), and instrumentation, since they are well studied in 
the literature. 

Fig. 6.9 shows the ratios b{k) and d{k). The dispersion in d{k) is larger than that 
of b{k), because both Pf(^) and Pfj^{k) contribute to the dispersion of d{k), while b{k) 
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Figure 6.9: The ratio b{k) = ^/PF{k)/PlD,L{k) (upper panel) and d{k) = sJPF{k)/Pf^{k) 
(lower panel). The Lyct forests are divided into groups, each of which consists of 20 LOS's. 
The PS's are averaged within each group except PiB,hik), which is theoretical. The error 
bars of the LCDM model are la dispersions among 610 groups. Error bars of other 
models are comparable to that of the LCDM model. 
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receives only a single contribution from Ppik). Even so, the error propagation would 
cause a large scatter in determining PiB,i,{k) and PL{k). In fact, the scatter can be large 
enough that one would not be able to determine the cosmological model based only on the 
recovered Pl(^)- On the other hand, the model dependence of b{k) is strong enough, so 
that one may have to assume a cosmological model to recover the mass PS from the flux 
PS. The difference among the models is not solely due to erg. For example, the standard 
cold dark matter (SCDM) model has a slightly higher as than the TCDM model, so one 
would expect b{k) of the SCDM model to have the same shape as that of the TCDM 
model. This is not observed in Fig. 6.9, however. It is also evident from the behavior 
of b{k) and d{k) that the PS of the underlying density field is significantly lower 

than the linear mass PS Pid,l(^) on scales k > 0.02 (km s~^)~^. This is due mostly to 
peculiar velocities, or redshift distortion. 
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CHAPTER 7 



CONCLUSIONS 

I have investigated the covariance of the one-dimensional mass PS for both GRFs and 
simulated density fields. The nonlinear evolution and non-Gaussianity of the cosmic 
density field on small scales (see also Zhan et al. 2001; Smith et al. 2003; Zaldarriaga et 
al. 2003; Zhan 2003) have caused correlations between the fluctuations on different scales 
and increased the cosmic variance of the one-dimensional mass PS. Because of this, a 
large number of LOS's are needed to accurately measure the one-dimensional mass PS 
and recover the three-dimensional mass PS. 

The length of LOS's introduces a window function in the LOS direction, which mixes 
neighboring Fourier modes in the cosmic density field. Fig. 4.8 lias demonstrated for 
simulations that the length of LOS's does affect the eovariance of the one-dimensional 
mass PS. The covariance of the observed one-dimensional mass PS will receive a significant 
contribution from this effect, because in practice the length of LOS's suitable for study 
is always much less than the size of the observable universe [see equation (3.11)]. 

One may reduce the cosmic variance by binning independent Fourier modes. How- 
ever, since the modes of the cosmic density field are strongly correlated, binning will 
be less effective in reducing the sample variance error. On the other hand, the non- 
Gaussian behavior of the covariance provides important information of the field such as 
the trispectrum. 

Using hydrodynamical simulations and A?^-body simulations, I find that pseudo-hydro 
techniques are able to reproduce the flux and flux PS that are obtained using the full- 
hydro method at z >2. There is also a good match between observed and simulated flux 
statistics such as the flux PS and its covariance at z = 3. 

The accuracy of pseudo-hydro techniques does not seem to be high enough for de- 
termining cosmology at percent level. One needs to precisely calibrate pseudo-hydro 
techniques with hydrodynamical simulations. Moreover, it is better to constrain cosmol- 
ogy using the flux PS on scales above a few ^"^Mpc in order to reduce the uncertainties 
caused by the IGM temperature and thermal broadening. At low redshift, the shock- 
heated WHIM has greatly altered the temperature-density relation of the IGM, so that 
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fluctuations in the Lja flux axe strongly suppressed by thermal broadening. As a result, 
pseudo- hydro techniques overpredict the flux PS at low redshift. 

The transform from density to flux quenches fluctuations by orders of magnitude and 
leads to near-Gaussian Lya fluxes. Hence, the variance of the flux PS is much less than 
that of the one-dimensional mass PS. In other words, one can measure the flux PS to a 
high precision with a relatively small number of LOS's, but the underlying mass PS cannot 
be determined as precisely as the flux PS. Attempts to recover the three-dimensional mass 
PS should take into account the sample variance error of the one-dimensional mass PS. 

It is possible to invert the Lya forest and measure statistics of one-dimensional density 
fields directly. I provide a scaling relation between the mean matter density pa and the 
width of saturated Lya absorptions w, which helps invert saturated Lya absorptions. 
The inversion with the ps.^w relation is able to recover the rcdshift-spacc one- dimensional 
mass PS fairly well on scales above 2 h^^Mpc, but it underestimates the power on small 
scales due to the use of the Gaussian profile. An improvement is suggested based on 
the observation that the small-scale one-dimensional mass PS can be better recovered 
if saturated absorptions are removed. Thus, by combining both the Pa.-w inversion and 
the result after removing saturated absorptions, one could get a good estimate of the 
one-dimensional mass PS over a wide range of scales. 

The PaT^ relation provides an important constraint to the inversion of the Lya forest. 
To incorporate it in any inversion scheme, one needs to determine the Par^ relation 
according to spectral resolution and noise. The threshold flux can be conveniently set to 
a small value above the noise level. The recovered redshift-space one-dimensional mass 
PS is not very sensitive to the p^-w relation on large scales. However, one would greatly 
underestimate the large-scale one-dimensional mass PS if saturated regions were excluded 
from the sample. 

As seen in Chapters 4 and 5, one-dimensional density fields have a large cosmic 
variance in their PS due to non-Gaussianity. Nevertheless, this variance can be reduced 

by a large number of LOS's. With thousands of Lya forests from QSO surveys, it is 
possible to subdue the variance to a few percent level, and the Lya forest may eventually 
become comparable to other fields of precision cosmology such as the CMB and weak 
lensing. 
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